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EDITORIAL 





A Rin §] dl e hile it’s not as historic as, say, Brown versus The Board of Education, or as exciting as Ali versus 
a Foreman’s “Rumble in the Jungle,” the Stallman versus Microsoft tussle has its David-and-Goliath 


Seat 


moments. In one corner, there’s the Redmond tag team of Jim Allchin, Craig Mundie, Steve Ballmer, 
and a squad of ringside support personnel. In the other corner, there’s Richard Stallman. 

In truth, Microsoft isn’t ganging up on Stallman per se, instead dissing the GNU General Public License (GPL). 
But, as any fool knows, going chin-to-chin with the GPL means taking on Stallman. Yo mamma! That said, 
they’re sort of badmouthing the GPL. Microsoft’s real opponent seems to be open source in general, and Linux 
in particular. The confusion is in how Team Microsoft continually muddles the concepts of “open source” and 
“GPL,” referring to them as being one and the same. They’re not— and Microsoft darn well knows it. 

Round One, you recall, started with Allchin stupidly describing open source as “unAmerican.” This 
idiocy was followed in Round Two by Mundie’s presentation 
(http://www.microsoft.com/presspass/exec/craig/05-03sharedsource.asp) at New York University’s Stern 
School of Business, in which he first lumped together GPL and open source, then introduced Microsoft's 
notion of “shared source”— a model that governs the terms under which Microsoft licenses source code to 
its “customers and partners” (http://www.microsoft.com/Business/Licensing/SharedSource/). In fact, there’s 
nothing new about Microsoft’s shared source at all. The company admits it has been sharing source code 
with some customers and academic institutions for years. 

Pursuant to Mundie’s talk, NYU’s Center for Advanced Technology invited GPL author Richard Stallman 
for a contrary view. Setting the stage for Round Three, a Microsoft public-relation minion delegated to the 
Stallman problem encouraged journalists “to take a moment prior to [Stallman’s] speech to read through 
these questions and to look at the GNU GPL FAQ” (http://www.ddj.com/articles/2001/0175/0175c/0175c.htm). 
These questions were graciously provided by Microsoft, of course. 

For nearly two hours, Stallman hammered home that free software and the GPL aren’t the same as open 
source (http://www.gnu.org/events/events.html). Much of Stallman’s discussion focused on the freedoms 
that free— not open source— software offer: The freedom to run programs for any purpose, any way you 
like; the freedom to change programs to suit your needs; the freedom to distribute copies of programs; and 
the freedom to publish improved versions so others can get the benefit of your work. In short, Stallman 
really didn’t offer much new (as someone pointed out on Slashdot.org “Headlines for today: RMS: Free 
Software Good, Dog Bites Man, Sky Blue”). His specific response to Mundie was best summed up in the 
letter “Free Software Leaders Stand Together” (http://www.perens.com/Articles/StandTogether.html), jointly 
signed with Bruce Perens, Eric Raymond, Linus Torvalds, Miguel de Icaza, Larry Wall, Guido van Rossum, 
Tim O'Reilly, Bob Young, and Larry Augustin. 

Opening Round Four, Microsoft CEO Steve Ballmer jumped into the ring with an interview in the Chicago 
Sun-Times (http://www.suntimes.com/output/tech/cst-fin-micro01.html). Without mentioning GPL, Ballmer 
did his best to again muddy the waters: “Open source is not available to commercial companies. The way 
the license is written, if you use any open-source software, you have to make the rest of your software open 
source.” Frankly, I have no idea what Ballmer is talking about— open source is available to commercial 
companies. However, I do understand when he states that “Linux is a cancer.” Of course, why he’d say such 
a thing is another question. 

Although I’ve never met Allchin or Mundie, I have met Ballmer and am here to tell you he is a very, very 
smart guy. Consequently, there’s no question that Ballmer, Allchin, and Mundie clearly understand the difference 
between open source and GPL. So why persist in intentionally obfuscating the truth? One explanation is that it’s 
just business— Microsoft is scared silly of Linux, open source, and free software, and will go to any lengths to 
poison the marketplace well. However, my favorite theory is that this is all a red herring. By bobbing and 
weaving about open source, attention is diverted from other controversial issues— such as licensing. 

Microsoft wants to make major changes in how it sells software, and they’re going to be very hard sells 
indeed. What the company wants to do is treat software as a service, a la the failed IBM MVS model of years 
ago. In the service model, customers lease software from Microsoft, rather than buy it. What makes this 
viable today, however, is distributed computing and the emergence of web services (such as Microsoft’s 
.NET), whereby application software resides on vendor servers rather than local disks. Continuous Internet 
connections let vendors have non-stop access to customer systems for monitoring— and enforcing— license 
compliance. Miss a monthly lease payment, and your access to applications is restricted. While feinting over 
open source, Microsoft is able to slide another agenda in under the radar. 

Round Five should be a doozy when Mundie climbs on stage at the O’Reilly Open Source Convention 
(http://conferences.oreillynet.com/cs/os2001/view/e_sess/1834) to further derogate open source and 


promote shared source. Here’s hoping for a ringside seat. 


Jonathan Erickson 
editor-in-chief 
jerickson@ddj.com 
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Huge GCC Executables 

Dear DDJ, 

Somewhere in an old DD/ column, Al 
Stevens noted how big GCC-produced ex- 
ecutables can be— up to 1.6 MB for a sim- 
ple “Hello World!” compiled with the Stan- 
dard C++! I found this statement on the 
Cygwin FAQ (http://sources.redhat.com/ 
cygwin/faq/faq.html#SEC77): “By default, 
GCC compiles in all symbols [...]. If that 
bothers you, just use the ‘strip’ program, 
part of the binutils package. Or compile 
with the ‘-s’ option to gcc.” 

I guess that this behavior’s purpose is 
to let you identify in which function a pro- 
gram crashed just by loading a coredump 
in a debugger. I don’t know if this is linked 
to the extreme sizes that you noted, but 
I suspect that it is. 

Paolo Bonzini 

bonzi@pc-amo3.elet.polimi.it 


Java Class Loader 

Dear DDJ, 

Thanks for including an article on class 
loaders, “A Java 2 Network Class Loader,” 
by Lorenzo Bettini and Donato Cappetta 
(DDj, February 2001). Class loaders are 
fundamental to understanding Java, and 
are all too often ignored in intro-level ma- 
terials. 

Readers should note that Java does in- 
clude a built-in network class loader, 
java.net.URLClassLoader, so there is no 
need to write your own. Also, the built- 
in URL and URLClassLoader classes let 
you separate the two key tasks performed 
by most class loaders: resource resolu- 
tion and security. The URL class permits 
pluggable protocol handlers, which ab- 
stract away the details of network con- 
nection, as discussed in Brian Maso’s ar- 
ticle “A New Era for Java Protocol 
Handlers” (http://developer.java.sun.com/ 
developer/onlineTraining/protocolhandlers/). 
The URLClassLoader extends SecureClass- 
Loader, which associates property per- 
missions with class loaders. SecureClass- 
Loader is the backbone of Java security. 


10 


However, if you extend ClassLoader instead, 
as the code in the article did, the entire se- 
curity architecture goes out the window. Fi- 
nally, the article’s NetworkClassLoader is in- 
secure, and is a poor abstraction of resource 
resolution when compared to the classes 
that are already part of the core API. Read- 
ers should view the NetworkClassLoader as 
an educational example, not as a recom- 
mendation for how to write or use class 
loaders in any production environment. 
Stuart Halloway 
http://staff.develop.com/halloway/ 


Department of Dumb Ideas 
Dear DDJ, 
Here’s one for your department of dumb 
ideas. In its online catalog (http://www 
.tadioshack.com/product.asp?catalog%5 
Fname=CTLG&category%5Fname=CTLG%5 
F0Q02%5F003%5F018%5F000&product5Fid= 
33%2D2055), Radio Shack is selling a 
digital VU meter. However, the picture 
in the catalog shows the case, not the 
meter. 

John Penry 

jpenry@satx.rr.com 


Setting the Record Straight 

Dear DDJ, 

Richard Stallman would receive far more 
sympathy and positive response if he were 
more courteous and polite (see “Letters,” 
DDJ, April 2001). As it is, most of the com- 
ments, though valid, are simply ignored 
by the community at large. 

The operating system popularly referred 
to as “Linux” will never be called “GNU/ 
Linux.” Get over it, RMS. For as much as 
Linux is no OS without GNU, GNU is no 
OS without a kernel, and HURD is still a 
ways off. Furthermore, it was the Linux 
kernel that drew the large crowd of users, 
not GNU. Linux was the project that es- 
sentially sparked the “Bazaar” model of 
development, leveraging the Internet and 
a large community of developers to rapid- 
ly improve and enhance itself. While GNU 
is an extremely laudable effort, its devel- 
Opment was much more controlled until 
recently. Thus, more people associate the 
entire GNU/Linux system with the Linux 
kernel and the Open-Source philosophy 
than with the GNU operating-system soft- 
ware and the Free Software Foundation 
and its ideology. 

Berating people’s intelligence also 
doesn’t further your cause, nor does 
declaring projects “problems” as you did 
with KDE due to its dependence on Troll- 
tech’s QT under the previous, more re- 
strictive license. 

I respect your abilities, and under- 
stand— though I disagree markedly — 
with your philosophies. I also disagree 
with the continuous, bigoted insistence of 
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adherents of both Free Software and Open 
Source to characterize other operating sys- 
tems as inferior and their manufacturers 
as evil. These operating systems possess 
strengths that neither GNU nor Linux have, 
and fulfill needs that their Free or Open 
counterparts can't. 

Oluseyi Sonaiya 

oluseyi@tweny.tr.com 


The Legacy of C# 

Dear DDJ, 

After reading Marc Eaddy’s article “C# ver- 
sus Java” (DDJ, February 2001), it is in- 
teresting to note that Microsoft is taking 
a page from IBM’s mainframe Common 
Language Environment that supports all 
the programming languages on the IBM 
mainframe OS, such as OS/390. All the 
IBM compilers use the APIs that are in the 
CLE and so produce the same results. IBM 
has addressed the interoperability prob- 
lems quite nicely. Perhaps Microsoft can 
be successful in CLR. 

As for cross-language rather than cross- 
platform, perhaps Microsoft should take 
another leaf from IBM’s notebook. I doubt 
that will ever happen, however, as Mi- 
crosoft’s business model is to sell software 
in a monopoly environment, where only 
it sells the software. Microsoft is com- 
pletely dependent on Windows. Microsoft 
sees no reason to dilute its monopoly by 
entering other platform markets. It had no 
success with Windows NT on the DEC Al- 
pha platform (now Compaq). 

Microsoft has been predatory in its take 
overs of software such as Kerberos. That 
security software is now fractured and the 
promise of cross-platform security is dead 
as far as connecting Kerberos on Windows 
to any other implementation, due to Mi- 
crosoft making a Windows-specific mod- 
ification. 

Without a monopoly on the Internet 
and on Java, Microsoft will not grow. The 
momentum is building in the open OS are- 
na with Linux and Java for cross-platform 
development. IBM is investing US$1B in 
Linux development. It has already invest- 
ed a lot in Java on all its platforms. Where 
Microsoft is single platform, cross lan- 
guage, IBM is multiplatform single lan- 
guage. IBM’s business model is based on 
its hardware sales, and it could care less 
whose software runs on IBM’s hardware. 
It is obvious that Microsoft and IBM are 
at odds in the business arena. Microsoft’s 
success with Windows and the extension 
of Windows-dependent software to the 
Internet and enterprise is to the detriment 
of IBM. 

In a way, I’m in favor of C#. I can add 
yet another language to my résumé and an- 
other certification to the list I have, and pre- 
sumably that makes me a more valuable 
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How Design by Contract 
Could Improve Black-Box 
Testing” 





by Adam Kolawa 






















Design by Contract (DbC) is a formal 
way of using comments to incorporate 
specification information into the code 
itself. Basically, DbC involves expressing 
the code’s specification using a formal 
language that describes the code’s implicit 
contracts. These contracts specify such 
requirements as: 


¢ Conditions that the client must meet 
before a method is invoked. 

¢ Conditions that a method must meet 
after it executes. 

¢ Assertions that a method must satisfy 
at specific points of its execution. 


DbC can dramatically improve the quality 
and speed of black-box (functionality) 
testing, but unfortunately, there is not yet a 
standard for implementing DbC in C++. The 
development of such a standard would be 
beneficial because if your code’s 
functionality were clearly expressed in 
terms of the code’s methods and classes, it 
would be much easier to build a test suite 
that completely and accurately tested 
whether the intended functionality was 
indeed implemented. 

For example, imagine you have a 
method that should return the value of 
a+b, and this functionality is documented 
with a postcondition contract. However, 
this method contains an error: the code 
that should have been written as a+b was 
written aS a-b. Because the code’s 
expected functionality is clearly 
documented, you can easily determine that 
you should test if the method returns a+b. 
If the expected functionality was not 
documented, you would have to look at the 
Specification document or review the 
undocumented code in order to determine 
what functionality to test. If you used the 
first method, you would have to locate and 
read the specification document. If you 
used the second method, you might 
mistakenly decide to test if the method 
returns a-b. Both methods would 
consume a fair amount of time and could 
Cause you to overlook a simple yet critical 
functionality problem. 
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C++Test™ is a C/C++ unit testing tool that can 
automatically test any C/C++ class or function 
as soon as you have compiled it. 























C++Test performs white-box, black-box, and regression testing. 












































Unit testing is widely recognized as one of 
the most effective ways to ensure application 
quality, but until now, performing unit testing 
has been practically impossible. If you 
wanted to perform unit testing, you had 
to perform every step—from building 
scaffolding, to building stubs, to building and 
running test cases—by hand. This process is 
difficult, time-consuming, and expensive. 


C++Test does all of this automatically, with just 
the click of a button. C++Test automatically 
performs white-box (construction) testing, 
black-box (functionality) testing, and regression 
testing on any compiled class or function. This 
means that you can reap the benefits of unit 
testing immediately and effortlessly. 













White-box testing checks the construction of 
your code. C++Test performs white-box 
testing by automatically generating and 
executing test cases to see how files, 
classes, or methods behave when they are 

passed unexpected inputs. 











Black-box testing verifies that 
each class behaves according to 
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FREE DEMO at: 
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ParaSoit’ 
c++test 


Click once for 
automatic C/C++ unit testing! 


“C++Test does everything we wanted to do, without us having to do the work.” 
—Thomas Gielow, Demand Management, Inc. 


specifications. C++Test performs black- 
box testing by automatically generating 
and executing test cases and displaying 
the outcomes, which you can verify with 
the click of a button. You can also enter 
your own black-box test cases to be 
executed by C++Test. 


Regression testing verifies that your 
modifications corrected existing problems 
and did not introduce new problems. C++Test 
performs regression testing by repeating the 
test cases that were run the previous time the 
class was tested. If outcomes have changed 
or if C++Test finds exceptions, it reports 
regression errors. 


Of course, unit testing is just one stage of 
the error prevention process for C and C++ 
developers. C++Test works with ParaSoft’s 
CodeWizard® to automatically enforce coding 
standards each time you build a file. C++Test 
also works with ParaSoft’s Insure++° to 
automatically detect runtime errors each 
time you test a class or method. 


To test automatically at the unit level, 
download a fully-functional demo of C++Test 
today at www.parasoft.com/ddj8, or for more 
information call (888) 305-0041. 
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(continued from page 10) 
resource for the job market. On the oth- 
er hand, I’m paid for success, and the most 
stable platforms for large enterprise sys- 
tems of applications are not Windows. I 
must also contend with interfacing with 
legacy systems, and the cross-platform ap- 
proach is where established enterprise data 
processing solutions succeed best. 
Bruce E. Hogman 
bruce.hogman@eds.com 


DHTML Source-Code Correction 

Dear DD], 

Both DD/J and DDJ.com are good re- 
sources for the programming community. 
I particularly enjoyed the February 2001 
issue, especially the excellent article 
“Cross-Platform DHTML” by Charlie Ma. 
However, there is a typo in Listing Two 
(page 128 in the magazine) that prevents 


the samples from running. The “x” on the: 


last line of the excerpt below (after the 
“+” and before the “=’”) is extraneous and 
stops the code from executing properly. 


<!— StartFragment—> 
if (obj.nodeType == 1) { 
// nodeType==1 for all HTML tags other 
than comments and CDATA. 
msg += “&lt;” + obj.nodeName; 
for (var j=0; j<obj.attributes.length; j++) { 


Microsoft 


if (obj.attributes.itemG).nodeValue) { 

msg += “ “ + obj.attributes.item(j).node- 
Name 

+ x”=”” + obj.attributes|j].nodeValue + ““; 
<!—EndFragment —> 


I learned a lot from the article and the 
samples are cool. 

Greg Bissell 

gerui@artofchina.com 


Shared Libraries Arent All Bad 

Dear DDJ, 

Okay, I'll be the first to admit that it’s bad 
form to reply to a reply to a response. 
However, I feel a slight twinge of moral 
imperative to reply to Jeff Duntemann’s 
reply to Michael Wojcik’s response to Jeff’s 
original assertion that “[tlhe worst idea 
ever to arise in the history of computing 
is shared code libraries” (“Letters,” DD/, 
March 2001). Two things need to be 
sharply pointed out. First, I feel that Jeff 
is obliged at the very least to admit that 
his original statement was far too broad- 
ly stated. After reading his reply, it seems 
that what he really meant to say is that 
the worst idea ever to arise in the history 
of computing is Windows’ implementation 
of shared code libraries. Second, the con- 
servation of disk and memory space is only 


20 percent of the issue, if that much. To 
say that we should get rid of this concept 
of shared code libraries is like saying that 
we should get rid of this concept of sub- 
routines; after all, any modern editor can 
perform a copy-and-paste in a jiff. Any 
programmer who has been too liberal with 
copying-and- pasting will quickly learn the 
error of his ways as soon as he discovers 
a bug in one of those lines of code. 

Michael Hobbs 

hobbs1@acm.org 


Zuse and Intel 
Dear DDJ, 
In his recent “Letter” (DD/J, December 2000), 
Oleg Kiselyov points out that Intel’s ex- 
citement over the “advance” of predicated 
execution is inappropriate, as it was mas- 
sively predated by Zuse. Well, Intel may 
be forgiven for not being aware of the in- 
ner workings of Zuse, but if they are un- 
aware that Acorn’s ARM chip had this fea- 
ture more than a decade ago, they’re even 
more stupid than they are arrogant. 

Ben Laurie 

ben@algroup.co.uk 
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1c Complex Hosting in a Global Environment 


Solid, simple application development. 


The problems you're asked to solve every day are increasingly more complex: 
Management wants it to be faster, more intelligent, more secure and all i 
"Internet" time. At INNERHOST we have the bandwidth, hardware, software 
and expertise to simplify and customize the deployment and ongoing 
application infrastructure management processes. 

From iron-clad hardware to leading edge software and over 50 Microsoft 
Certified Professionals, iNNERHOST is committed to providing the 
premier environment for your physical and logical Microsoft application 
infrastructure requirements. 

Our complex managed hosting and application solutions enable our clients 
to outsource all of their Microsoft mission critical and mission essential I 
infrastructure needs and allow them to capitalize on the latest technologies 
and packaged software applications. For more information, contact us to 
develop your customized service level agreement. 
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The Pay-Per-Use 
Concept 






Imagine the possibilities. 
| LL) Increase you revenue potential by offering your 
- | | software on a pay-per use or metered basis. 


Your customers will enjoy flexible pricing that 
allows them to pay only for what they use, and 
you can set your business apart from others in 
a world of flat-fee software sales. 


Metered software is just one of many possible 
Cr | sales and licensing models that Privilege opens 
for software developers. With Privilege, you can 
offer your software exactly « as your customer 
wants to buy it. 


WE KNOW HOW YOU'LL SELL IT TOMORROW. 


Start selling your software the Privilege way today. 
To order your Software Developer’s Kit, call us at (800) 562-2543, 
or visit our website at: www.eAladdin.com/ddj 


IT’S A NEW WAY TO SELL SOFTWARE. rrnileg e 
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100+ pages of products 
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for Developers! 


Get a FREE Subscription 
to our catalog by calling 







or subscribe at 





* Sold separately. 
Call for pricing on compilers. 


VTune Performance 
Analyzer 
Paradise # 

123 0153-FT 





TX Text Control 8.0 
by The Imaging Source 
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A powerful programming component which 
allows developers to easily add into their 
application sophisticated text formatting and 
display capabilities typically seen only in large 
word processing programs. 









Download a 
demo today. 
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TX Text Control 8.0 Professional Edition 
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VSVIEW® 7.0 Reporting Edition 


by ComponentOne 


ComponentOne VSVIEW Reporting 
Edition adds powerful, flexible 
database reporting to one of our 
most popular ActiveX® controls, 
VSVIEW Classic Edition. Now you 
can quickly and easily create 
Microsoft® Access-style database 
reports for your Visual Basic, VC++, 
and C++Builder applications, plus 
preview, print, and export those 
reports on your computer or over 
the Web. 


Intel® Software Performance Tools 


by Intel 


e VTune™ Performance Analyzer non- 
intrusive sampling and call graph 
profiling offer multiple ways to 
understand code performance. 

e Intel® C++ Compiler* designed from 
the ground up to take full advantage 
of Intel’s latest processors. 

e Intel® Fortran Compiler* delivers 
outstanding application performance 
with advanced features like Profile- 
Guided Optimization. 


Mozquito Factory 1.5 
by Mozquito Technologies 


Mozquito Factory 1.5 brings the strength 
and agility of XML to your computer 

with an easy-to-use editor and a 
transformation engine that translates the 
files into a format all major browsers can 
display. You can now create highly 
interactive Webpages without plugins, 
applets, VB scripting or cross-platform 
worries. Forget DHTML. XHTML and 
Mozquito Technologies’ new tags 

make menus, checkboxes and content 
placement in templates a breeze. 


eWasp Bar Code 


Web Component 
by Informatics 


eWasp Web Components allows you 
to easily create bar codes that can be 
printed from your client’s browser. 
Enhance your website with bar codes 
by creating on-line coupons, tickets, 
check-in confirmations, license 
registrations, rebate mailers, RMA 
packing slips, invoices, electronic 

gift certificates, and much more. 





or call 800-445-7899 











FarPoint’s Spread 


3.5—New Version! 
by FarPoint Technologies 


Use FarPoint’s Spread 3.5 to easily 
incorporate high-level spreadsheet 

een and advanced grid features to your 
setae? You reat unmatched flexibility at the cell level for 
maximum control over the display and entry of your data, plenty 
of events to respond to user changes and an impressive feature 
list for everything in between; including import/export capabilities, 
enhanced printing, 12 cell types, formulas, 
and an improved Spread Designer. 
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by Softwise Company 
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Integrate bug tracking over the Internet 
and a local area network in one database 
with the same easy-to-use interface. 


PR-Tracker’s simple yet powerful 
m interface requires no training and 
fe enables you to configure a project in 
about 15 minutes. Features include: 





10-24 Users 


Paradise # customizable data entry, workflow, 
5 views and queries; one click queries; 
saceeheeetll attachments; and email notification. 
* Price per user. 
TestTrack™ 
by Seapine Software™ 


TestTrack Pro and Workgroup are the most 
powerful defect tracking solutions for Web, 
_| Windows, Macintosh and Linux. TestTrack 
_..| Pro 3.1 with XML delivers a feature-rich 

< — solution to any user group by providing 
precise tracking abilities including: defect tracking; feature request; 
customer and user test configurations; e-mail notification; e-mail bug 
importing; duplicate defect handling; release note generation; and un- 
limited customer care. TestTrack Workgroup 2.0 delivers a rich feature 


set for smaller teams. 
2.0 Workgroup Ed. : 3.1 Professional Ed. 
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Office XP Developer 
Edition Upgrade 


by Microsoft Corporation 


The Office XP applications include a 
broad set of new features and services 
that extend the types of custom Office 
solutions that can be built for both 
the enterprise and the Web. Office XP 
Developer provides the tools and 
features to enable developers to 
make use of this new functionality, 

as well as enhancements that ensure 
developer productivity. 
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RoboHelp Enterprise is intelligent 
server-powered Help software 
that provides: 


e User feedback reports to update 
and improve the effectiveness of 
your Help system 


Superior natural language search 
¢ Support for fast project merging 
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DevTIrack 
by TechExcel 


DevTrack is the premier defect- and 
project-tracking tool for software 
development teams, helping to ensure that 
development projects finish on time and on 
budget. DevTrack comprehensively tracks 
and manages all defects, change/feature 
requests, and all other development issues. 
DevTrack also provides powerful workflow 
and process automation features, robust 
searching and reporting, and comprehensive 
point-and-click customization. Intuitive and 
powerful, DevTrack provides a scalable out- 
of-the-box solution, at a great value. 


c-tree Plus® 
by FairCom 





Paradise # 
T34 0170-FT 


Windows Linux ¢ Mac ¢ Novell 
Lynx ¢ QNX @ AIX @ FreeBSD @ HP UX 
SGI ¢ Solaris © Tru64 ¢ Many more! 

This well-known data management 
package is the tool of choice for serious 
database developers. Low-level and 
high-level ISAM functions offer an 
unprecedented level of control. Build fast, 
royalty-free single-user and multi-user 
applications, or utilize the optional FairCom 
database Server. ODBC and Crystal 
Reports drivers available. Small footprint, 
cross-platform support, full source code. 
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ie ~Borland Delphi 6 
: Professional Edition 
E Upgrade 

by Borland 


Delphi 6 makes rapid e-business 
development a snap three times with 
BizSnap™, WebSnap™, and DataSnap”. 
Using BizSnap, business to business 
integration with Web Services has 
never been easier. With WebSnap, 
bring the power and speed of RAD to 
your Web application development. 


Borland 
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RoboHelp® Enterprise : 


: RoboHelp Office is the fastest 

: and easiest way to develop full- 

: featured Help systems and online 

: documentation. RoboHelp gives 

: you the unique ability to create one dyes. 

: Help project, and with a few mouse - 
: Clicks, generate any Help format. 
: Create Help Systems for: 


e Windows and Cross-platform applications 
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Should you see one of these products listed at 

a lower price in another ad in this magazine, 

CALL US! We'll beat the price, and still offer 

our Same quality service and support! 

“Terms of the offer: 

¢ Offer good through August 31, 2001 

¢ Applicable to pricing on current 
versions of software listed 

¢ August issue prices only 

¢ Offer does not apply towards 
obvious errors in competitors’ ads 

@ Subject to same terms and conditions 

Prices subject to change. 

Not responsible for typographical errors. 
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e Websites and Web-based applications. 



















































Visual SlickEdit® v6.0 
by MicroEdge, Inc. 


Source code analysis, side-by-side 
file comparison, extensive language 
support, easy to implement and 
configurable to your coding styles. 
New Java capabilities including 
Java Tool Integration for setting 
compile, execute, debug, javadoc 
and jar options. 
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1 PowerBuilder 8 
| Enterprise 
by Sybase 


PowerBuilder 8 is the easy-to-use, scalable, 
and proven rapid application development 
environment that provides a cost-effective 
migration path to next-generation archi- 

{ tectures while simultaneously protecting 
existing investments. It eases Web and 
distributed development by providing 
seamless application server integration 
through full development, testing, 
debugging, and deployment processes. 


SYBASE 


INFORMATION ANYWHERE. 


Multi-Edit 9 
by American Cybernetics 


Multi-Edit 9, the professional 
programmer's text editor, is now 
packed with more features than ever 
before. New features include: Visual 
Studio Integration; ColdFusion 
Integration; improved Project Manager: 
language support for Python, SQL, 
Progress, REBOL; and more. 
Experience enhanced PERL support 
and improved handling of numerous 
embedded script languages. 
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Orbix 2000" 


Your CORBA e-Business Infrastructure 


Orbix 2000, available for C++ and Java programming 
environments, is the next generation of IONA’s CORBA-based 
matcecel(caueclacwe o)e-lace) daniel liiiane)amr- lm oy-)<-dalenolcialellat-manrecelelt-le 
architecture, the Adaptive Runtime Technology™ (ART). 


Because today’s large-scale distributed applications are subject to constant 
change, they require a standards-based, scalable infrastructure that provides 
sophisticated configuration and management facilities. Orbix 2000 meets these 
needs, conforming to the industry's latest standards for software interoperability, 
while offering new patent-pending Adaptive Runtime Technology™ (ART) that 
delivers increased scalability, flexibility and performance for distributed applications. 


Orbix 2000's Portable Object Adapter (POA) supports a virtually unlimited number 
of objects, and supports enterprise CORBA services such as Interoperable Naming, 
SSL-based Security, Portable Interceptor Specification, Asynchronous Messaging 
Interfaces, and Persistent State Service. 


Visit IONA’s website, www.iona.com, to learn more about 
Orbix 2000. 
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Curl Programming 
Contest Winners Announced — 
Friedger Mueffke and Nikhil Damle ae . 


been awarded top prizes in the Curl pro- — 
‘gramming contest, sponsored by Curl — 
Corp. (http://www.curl.com/). Mueffke © 
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- The World Wide Web Consortium 1 (W3C) 
ia formally issued the XML Schema as a __ 
_ W3C Recommendation, meaning it is sta- 
ko ble, contributes to web interoperability, 

al Ss Ion / and has been reviewed a the W3C, mem- 
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best match between a user’s preferences" 


(stated in high-level terms) and the ser- 
a vices it finds available. For more infor- 
mation, see http://ana.lcs.mit. edu/anaweb/ | 
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The XML Schema spccificnicn ae 


of three parts. The first defines a set of 
_ simple datatypes that can be associated _ 

with XML element types and attributes; _ 
this is intended to allow XML software to _ 
do a better job of managing dates, num- 
bers, and other special forms of informa- 

. | fon, The second part of the specification 
proposes methods for describing the struc- 
a ture and constraining the contents of XML 
documents, and defines the rules govern- 
ime schema validation of documents. The 
third part is a primer, which explains what 


. schemas are and how they are built. - 


XSV, the XML Schema Validator, has 


| ; _ been. revised to validate against ee 
If P - recommendation. The W3C is also solic- 
iting sample schemas from developers for 


a test suite library to be managed by the 
d W3C XML Schema Working Group, 
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Ken White 


n the late 1800s, Thomas Edison and George Westinghouse 

became embroiled in what has become known as “The War 

of the Currents.” Edison had invested heavily in infrastruc- 

ture, supporting the use of direct current for the distribution 
of electricity. Westinghouse, having bought patents on the in- 
ventions of Tesla, advocated alternating current. For almost a 
decade, early adopters of electricity were forced to endure a 
spectacle that included, among other things, public electrocu- 
tions of large animals (performed by Edison to show the dan- 
ger of alternating current; he tried to popularize the slang “West- 
inghoused” as a synonym for “electrocuted”). For a short period 
of time, there were two sets of infrastructure that operated un- 
der different assumptions about how power should be trans- 
ported and consumed. Fortunately, the technology was young 
and the infrastructure was immature, so the cost of competing 
standards was relatively low. Even so, consumers of electricity 
were forced to make an expensive choice, balancing the in- 
vestment in existing infrastructure against superior technology. 

Fast forward about 100 years to Christmas Day, 1981. Under 
the tree waiting for me was a new TI 99/4A home computer. 





Ken is principle database architect at Pervasive Software. He can 
be contacted at ken.white@pervasive.com. 
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With 16 glorious kilobytes of memory and a blazing 3-MHz 
processor (not to mention a built-in Basic interpreter), the 
99/4A was a future techie’s dream come true. Like most com- 
puting machinery, the components inside the 99/4A required 
direct current. Since my house was wired for alternating cur- 
rent at the time (at 60 Hz, no less), this resulted in a mismatch 
of assumptions. The good news was that through amazing 
foresight, the engineers at Texas Instruments had included with 
the computer a transformer, which was capable of adapting 


the power from the outlet for use with the 99/4A. A year or 
so later, TI discovered the possibility that if the transformer 


malfunctioned, a user touching the metal case of a 99/4A could 
be Westinghoused. You can only imagine Edison screaming, 
“I told you so!” as TI spent the cost of distributing replacement 
transformers. 

Data is the power that runs a modern business. When con- 
sumers of that power have different assumptions, a transfor- 
mation is required. Very bad things (including the loss of in- 
frastructure investment) can happen if that transformation is 
not carefully planned. Currently, there are at least three major 
data management paradigms—ISAM, SQL/Relational, and XML— 
in use, with XML poised to explode. Each has different as- 
sumptions regarding how data models the organization’s view 
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of the real world. When different models try to operate on the 
same source of power, they must reconcile these differences. 
The process of reconciling these assumptions can result in data 
loss, performance degradation, system fragility, or feature un- 
availability. The ideal solution is for every consumer to enjoy 
native and natural access to the power source, without one mod- 
el compromising another. 


Database Management Systems 

Data management can be thought of as having two principal in- 
gredients— a data model and a system for implementing that 
model. The data model is responsible for describing the princi- 
ples by which the data will represent the outside world. This 
model is implemented through a database management system 
(DBMS). Over time, DBMS has 
come to mean a software pack- 
age that handles database man- 
agement requirements. 

Because data is the power for 
business, DBMSs (and their asso- 
ciated processes) tend to evolve 
more slowly than other, less 
mission-critical segments of com- 
puting infrastructure. In fact, 
database management models 
have remained relatively un- 
changed through the emergence 
and explosion of the Internet. It 
usually requires a significant shift 
in business practice to engender 
any change in database management. Distributed computing 
(such as web services) appears to be one such shift; it is grow- 
ing rapidly and many major vendors expect distributed com- 
puting to be the next major paradigm in application develop- 
ment. XML is establishing the system for distributed data 
management, and this system does not neatly fit with the as- 
sumptions made by existing infrastructure. 

Much as there are some fundamental electromagnetic prop- 
erties that are the same across different types of current, there 
are some fundamental data management issues that are addressed 
by all data models. Abstracting these common issues is the first 
step to understanding the costs of data model differences, as 
well as possible remedies. The next step is to determine the as- 
sumptions and rules each model places around the basic ab- 
stractions; those assumptions and rules are the things that don’t 
always match. The points at which the assumptions differ not 
only help to describe the problem, but also serve as the first 
step for understanding and evaluating available solutions. Fi- 
nally, the mismatched assumptions and the shortcomings of the 
current solutions serve to help define what a future solution 
might look like. 


Database Concepts of Concern 

The first assumptions involve three relatively basic (and famil- 
iar) database concepts: entities, attributes, and relationships. (AI- 
though some consider the Entity Relationship model to be a data 
model, I don’t treat it as such here. For one thing, there is no 
significant implementation of E-R as a model separate from the 
other models I discuss. The value of E-R for purposes here is 
that it provides useful terminology for abstracting the common 
elements of the models that will be discussed.) In addition, each 
model addresses collections, operational efficiency, and the re- 
lationship of these concepts to the rest of the computing envi- 
ronment. There are other issues involved with data management 
(concurrency, operation atomicity, relational integrity, and so 
on), but these issues are not necessarily differentiating factors 
between models. 
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The first basic concept is that of entity— the thing that is being 
stored and is representative of something in the external world, 
such as a customer, invoice, or inventory item. It may be thought 
of as the most granular representation of data that retains context. 
The question of what exactly comprises an entity is more frequently 
resolved through data/business analysis than through application 
of the normalization formulas found in database textbooks. 

The second concept is that of attribute—a descriptor of an 
entity. If you have a customer entity, you can use attributes to 
keep track of important descriptors, such as a person’s favorite 
games or whether he collects butterflies. Depending on your par- 
ticular prejudices, you may think of attributes as fields or columns. 
Attributes rely on the entity for context. Is the attribute “Zip Code 
= 11111” meaningful by itself? Does it refer to a vendor or a cus- 
tomer? Which vendor or customer? 

The third concept is that of rela- 
tionships. A customer entity and 
three order entities are useless in a 
business process unless you have 
some way of making their relation- 
ship persistent; that is, some way of 
denoting that the order entities were 
made by the person represented by 
the customer entity. In relational the- 
ory, this can be represented by for- 
eign key relationships. 

Obviously, a database would 
not be of much use if it allowed 
you to have one customer entity, 
one order entity, and one inven- 
tory entity. It would be almost as useless if it did not let you ac- 
cess the collection of customer entities independently from the 
collection of order entities. Inherent in a database paradigm is 
the grouping of each of these three concepts: Who are all of 
my customers? What are the descriptors that I’ve stored for a 
particular customer? What other entities are important in rela- 
tion to this customer and how do I access them? 

Although a flat text file could act as the basis of these con- 
cepts, the practical requirement is that specific information can 
be quickly and efficiently retrieved. It is critical that the perfor- 
mance does not degrade as more entities are added. This per- 
formance requirement is most often met through the use of in- 
dexes or keys. 

Finally, there is the relationship between the database system 
and the rest of the computing environment— in particular, the 
operating system and application. Early on, the database paradigm 
was represented by a set of procedures and coding standards 
that dictated how a particular shop’s application code interacted 
with operating-system code. The evolution of database systems 
has seen at least one consistent trend: the abstraction of the 
database paradigm from application and operating-system con- 
straints and the encapsulation of that abstraction within a database 
infrastructure. In other words, relational database application de- 
velopers typically no longer worry about the offset and length 
of a particular attribute or the particular OS file that contains the 
attribute data. Those issues are abstracted within a database in- 
frastructure that generally is viewed as, if not a black box, then 
a really, really dirty one with very tiny windows. While this trend 
is common throughout computing and may seem self evident, it 
has implications for the future state of data management. 


Stages of Database Evolution 
Database technology has evolved through several stages, in- 
cluding ISAM, SQL/Relational, and XML: 


ISAM. Although ISAM has not been formally standardized as 
a data model, thanks to the dominance of Cobol and the effect 
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of that dominance on database management, there is a com- 
mon set of well-understood expectations for an ISAM DBMS. 
In the ISAM paradigm, entities are records. Attributes are un- 
derstood to be data stored starting at a specific offset for a spe- 
cific length. The application is responsible for maintaining rela- 
tionships, usually performed in much the same way as the 
relational model, where entities are collected in OS files, and 
the application (and thus the developer) is responsible for know- 
ing which set of records is in which file. The application can in- 
clude multiple types of records in one file, but any differences 
in entity type within a file must 
be implemented, understood, and 
maintained by the application. The 
DBMS does not understand any 
distinction between different en- 
tity types within the same entity 
collection. 

Efficiency is achieved through 
the use of indexes. Since the 
DBMS is responsible for main- 
taining index information and the 
DBMS does not make any dis- 
tinction between entity types with- 
in an entity collection, an ISAM 
file indexes the same attributes for 
an entire collection. This can result in added responsibility for 
the application if multiple entity types are in the same collec- 
tion. Furthermore, since the DBMS is unaware of any nonin- 
dexed attributes of the entity, the same entity can be viewed as 
having several different compositions, and there is no guaran- 
tee that the attribute indexed by the DBMS is an attribute that 
is meaningful to the application. (For instance, an index can con- 
tain pieces of several attributes, which then will act as a prima- 
ry key, albeit one that has no true meaning in the real world.) 

An ISAM application acts as if it is operating on the physical 
representation of the record (which it is, in most implementa- 
tions). As a result, much of the database management of the 
ISAM paradigm is closely tied to both the operating system and 
the application. 

SQL/Relational. From a theoretical standpoint, the SQL 
paradigm and relational model are not synonymous. In fact, SQL 
can be used to build result sets that do not meet relational re- 
quirements. However, the average computing professional is not 
interested in purely theoretical DBMSs, and when most people 
use a relational database, they are almost invariably using SQL 
to manipulate the data (whether directly or under the covers, 
as is often the case with ADO). 

In the practical relational view, entities are represented by 
rows. Collections of entities are represented as tables (or R- 
tables, or relations, if you want to be exact). Unlike the ISAM 
practice, the application has no control, knowledge, or concern 
with the interaction between entity collections and the operat- 
ing system. Attributes, represented by columns, are understood 
by the system, and all entities (rows) within the same collec- 
tion (table) must share the same attribute composition (columns). 
The application has no knowledge about the physical structure 
of attributes within the storage of the entity. All of the applica- 
tion’s knowledge of entity composition must conform to the 
DBMS’s view of the data. Collections of entities and attributes 
may be arbitrarily defined at run time through SQL predicates 
(as well as through views). Relationships are persisted in much 
the same way as the ISAM model (the constraints, such as pri- 
mary key uniqueness, are formalized in the relational model, 
but the ISAM model is similar in practice). 

Indexes are also present in the relational paradigm, and ap- 
ply to the entirety of the system-defined entity collection (not 
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necessarily to derived relations). The system’s understanding of 
indexes is coupled with the system’s understanding of attributes, 
so unlike ISAM, an index is always composed of attributes that 
are meaningful to the application. 

To summarize, the relational model abstracts the database 
from the operating system and to some extent from the appli- 
cation. There is no longer an exploitable interaction between 
the operating system and the DBMS. Furthermore, while the ap- 
plication may have foreknowledge of the database composition, 
it is incapable of using that knowledge in a manner that is not 
understood beforehand by the DBMS. An application can also 
be written that derives all of its in- 
formation about the database at 
run time, which is certainly not 
the case with the ISAM paradigm. 
This abstraction frees the applica- 
tion and the database administra- 
tor from a number of concerns re- 
garding the internals of data 
management, but it also demands 
that the application conform to 
the expectations of the model. 

XML. XML is not a database 
paradigm. It is a portable persis- 
tence mechanism. However, in a 
distributed computing environ- 
ment, XML fragments may be viewed as small, disconnected 
databases because the integration task required at the receiving 
end is equivalent to integrating data from two different database 
paradigms. Each of these fragments (or minidatabases) is self- 
containing and conforms to a set of rules. Additional rules have 
been added by the introduction of XML schema, but these are 
not necessary to the existence of the XML fragment — it can be 
understood by an XML parser independently of the additional 
rules. The additional rules are used primarily to ease the 
impedance mismatch that occurs when a system of loosely struc- 
tured data is incorporated into a system of more highly struc- 
tured data. 

From a data perspective, entities are represented as elements 
within XML, and attributes (surprisingly enough) as element at- 
tributes. Relationships are perhaps even more important in XML 
than in the other models. While there are additional methods to 
describe relationships, any given entity represented by XML is 
enlisted in one primary relationship — its place in the hierarchy. 
An entity cannot be represented in a document without this re- 
lationship. Collections of entities are generally worked on as if 
they were files. Collections of collections of entities (that is, a 
database) are not defined by XML proper, as standard XML makes 
the assumption that at any given time there is only one collec- 
tion of immediate concern. In a striking departure from both 
ISAM and relational paradigms, entity composition may differ 
across instances of identically typed entities; that is, two cus- 
tomer descriptions within the same document may look com- 
pletely different and still be equally meaningful. The composi- 
tions and types of the entities that make up a particular collection 
are understood by the system. 

This model is tied less tightly to the application than other 
models in that a consumer of the data does not need to rely on 
either application-specific knowledge or a DBMS to understand 
the entities, attributes, and primary relationships that are repre- 
sented by XML data. 

As Table 1 shows, ISAM, SQL/Relational, and XML each make 
different assumptions. Thus, if an application works on a col- 
lection of entities, it must apply a model to that collection and 
operate in conformance with that model. Incorporating a dif- 
ferent model into an existing infrastructure requires that the dif- 
ferences in these assumptions be addressed. 
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(continued from page 24) 

Working with Data Across Paradigms 

It is not uncommon for businesses to have both ISAM and SQL 
applications running side by side; the expected adoption of dis- 
tributed computing requires that data can be used across all 
three paradigms as seamlessly as possible. Much like the AC- 
DC conversion that takes place inside a transformer, the differ- 
ences in assumptions often result in a series of transformations 
that adapt data from one paradigm to another. There are already 
several available mechanisms for accomplishing this task, al- 
though no single method is without drawbacks. Much of the 
data architecture in Microsoft’s .NET initiative is based on the 
expectation that distributed computing must be integrated with 
existing data centers. It is essential to understand the trade-offs 
that are inherent in each approach. 


¢ The pipeline approach (aka mapping). Currently, the most 
widespread solution is a pipeline approach of mapping and 
transformation. This works well when the destination has few- 
er constraints than the source (as is the case with most ISAM- 
SQL pipeline approaches), as it is relatively easy to add con- 
straints and assumptions to a system that is less constrained. 
The process becomes more difficult when building a less con- 
strained system on top of a more constrained system, as is 
the case with the ISAMSQLXML pipeline. 

There are various implementations of this, but the basic 
idea behind this approach is as follows: You and I decide that 
our businesses need to share data and that XML is the medi- 
um of choice. I expose operations that you can invoke over 
the Web to retrieve an XML document, and before you retrieve 
it, I have to be able to describe to you what I’m going to re- 
turn and what it means. Before you invoke my operation, you 
have to determine how you are going to integrate my data 
into your database in a meaningful way. (The odds against us 
having a similarly structured database are roughly the same 
as Gates and Ellison joining forces to support the open-source 
movement.) You then have to be able to map my XML out- 
put to conform to your expectations. If I put customers’ names 
together and you store the first and last name separately (for 
instance), then you have to be able to break a name apart be- 
fore committing it to your primary data store. 

Once our initial contract is made and kept, you have 
additional problems to consider. How do you maintain the 
integrity of the XML transaction for audit-trail purposes? 
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This means not only understanding hierarchical relation- 
ships in your central data store, but also maintaining the 
exact ordering within the hierarchy. The biggest problem, 
though, is what happens when my lead developer leaves 
and I have to hire another one, and she prefers a different 
schema? What do you do with data you don’t understand? 
What do you do with data that doesn’t gracefully fit into 
your database? As the system scales outward, the likelihood 
of problems increases. 

Although each of these problems can be ameliorated some- 
what by individual fixes, there are several problems that are 
inherent with this solution. First, since the data is massaged 
into the more constraining system before it is stored, the trans- 
formation is either lossy or generates a great deal of secondary 
data that must be understood and maintained by the system 
(and anticipated by the system administrator). Furthermore, if 
the data does not conform to the expectations of the relational 
model, it cannot be stored at all. This is the basis of the em- 
phasis placed on XML validation. (Validation doesn’t really 
solve the problem, but allows the application to spot prob- 
lems earlier than it would otherwise.) Finally, there are po- 
tentially numerous transformations that must occur, and in a 
true pipeline approach, each one of these must be treated in- 
dependently. This requires computing resources and results 
in accumulative latency times. 

The .NET framework is an example of the pipeline ap- 
proach. The architecture proposed by Microsoft has a SQL 
database (which generally performs pipelined ISAM atom- 
ics) returning query results as a disconnected, in-memory 
database. This database can then be transformed as need- 
ed or viewed as XML or as a recordset. The power in this 
is the flexibility for developers; the weaknesses involve the 
performance and concurrency issues of the multilayered dis- 
connected approach, the mapping that must be performed 
beforehand, and the inability to adapt quickly to changes 
without losing data. 

Document storage. Another way to try to solve the 
impedance matching problem is the document-centric ap- 
proach of storing an XML document as a BLOB within a 
database. This means that as a document comes in, it gets 
stored as a document within an identifying record. One ad- 
vantage to this is that it maintains the integrity of the doc- 
ument. This system can also be enhanced so that indexing 
(or even full-text search capabilities) can be applied to the 





Table 1: Comparing ISAM, SQI/Relational, and XML. 
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document, so that it can be rapidly retrieved in part or in 
whole. One problem with this approach is that because the 
storage mechanism stays true to the document model of 
XML access, it incurs all the nonDBMS-friendly problems 
that exist with current XML practices. For example, con- 
currency issues get much more complex because as far as 
the DBMS is concerned, the XML work is being done on 
the most granular piece of data it is capable of storing (an 
attribute of an entity). How does a system effectively and 
efficiently protect part of an attribute? It cannot be done 
without changing the way the system views an attribute. In 
other words, this would require a redefinition of a funda- 
mental system implementation to overcome pragmatic de- 
sign shortcomings (a practice known as “kludging”). Fur- 
thermore, since the document is persisted as a document, 
making changes to one section of the document may force 
a rewrite of the entire document, which will invoke some 
of the same performance penalties as writing the document 
out to the file system through OS calls. 

Native storage. One of the methods of solving these problems 
that has received a great deal of attention in the industry is 
that of true XML storage. Typically, an XML document is added 
to the database and the DBMS is responsible for maintaining 
document integrity, relationships, and document granularity. 
Of the solutions, this is the most natural way to deal with XML 
because the data storage mechanism is written to address the 
assumptions and constraints of the XML paradigm directly. In 
a sense, this may be considered the opposite of the pipeline 
approach. Whereas the pipeline approach attempts to force 
XML data to conform to the relational paradigm, the native 
storage approach uses the XML paradigm as the reference 
database storage paradigm. 








Usually, native storage systems will directly address all 
of the problems mentioned in the other mechanisms. The 
problems with this method lie in the other direction: How 
do you still use the business-critical data you have that is 
currently stored by the ISAM or relational paradigm? Can 
you afford to convert your infrastructure? This approach 
assumes that the XML paradigm is your reference model. 
If you decide to convert all of your data, you will likely 
have problems with the expectations of your legacy ap- 
plication. Remember, the three paradigms have several ar- 
eas in which they have significantly different expectations 
of the application. 

Most XML databases have connectors between the XML 
data store and a relational data store, whereby data may be 
accessed from the relational store. However, this solution means 
that there are two databases— one for XML data and anoth- 
er for relational data. Like the pipeline approach, this re- 
quirement results in mapping strategies (which by their very 
nature are resistant to change). It may be difficult for tools to 
query across the two databases and at the least will invoke 
performance penalties in addition to map maintenance. Ad- 
ditionally, since incoming business transactions (such as in- 
voices) will presumably be coming in through XML, there are 
integration issues that cannot be avoided; at some point, some 
of your existing applications will need to view the XML data 
relationally, and that data must act as part of the same database 
as your legacy relational data. 

Architectural issues. With all the solutions currently being of- 
fered, how do you know which is right for you? Much of the 
decision involves your need to communicate with legacy 
databases. If you know that your need to interact with XML 
data is isolated from your need to work with your legacy re- 
lational data, then an XML database solution most directly 
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addresses the paradigm in which you plan to work. The most 
common cases, though, involve integration of XML data with 
legacy databases. Inertia would suggest that the majority of 
adopted solutions are going to be based on the current so- 
lution offered by the vendor of the existing legacy store. 

The problem with this is that some of the existing solu- 
tions, such as the pipeline approach, will experience scala- 
bility problems that may not be apparent upon initial de- 
ployment. While it may be easy to handle the relatively few 
schema changes I face now (working with only my largest 
partners), what is the likelihood that I will need to scale out 
and down? In other words, is it a competitive advantage/ne- 
cessity for my business to be able to interact with more part- 
ners and smaller partners — partners who may not be able to 
afford the same level of schema consistency that I have? If you 
are in a situation where you plan to work extensively with a 
relatively few number of partners, you will likely be able to 
rely on the contract and the pipeline approach may work 
smoothly. If not, the pipeline approach may introduce a quan- 
tity of potential failure points that is unacceptable. 

You should also take into account the number of trans- 
formations that are going to be required. Although most 
operating-system vendors are getting better at scaling through 
clustering (Microsoft in particular has made claims to the 
ability of .NET to scale out), reliance on clustering for per- 
formance improvement is often misguided. In addition to 
administration issues, clustering often introduces perfor- 
mance penalties in gaining additional capacity, a tendency 
that makes clustering a questionable solution for solving 
performance problems. An application that will operate ex- 
clusively on distributed data would be better served by be- 
ing architected around its transformation needs than by the 
traditional approach of being architected around the data- 
storage mechanism. 

Other considerations include the need for an audit trail 
(rendering a solution with lossy transformations unacceptable), 
concurrency needs, transactional needs, and whether the pri- 
mary operations on the data will be reads or writes. If data is 
going to be referred to often but modified rarely, the document- 
centric BLOB approach may be optimal. 


What Will Postrelational 

Databases Look Like? 

There are any number of visions about the future of data stor- 
age. However, based on the history of database technology and 
practice, it would seem that a vision should assume at least the 
following trends: 


e Databases will continue to abstract their operations from ap- 
plication and OS dependencies. 

e Existing infrastructure investments will demand that data stored 
in different paradigms can be accessed equally by all con- 
sumers. 

¢ Database models will be introduced with little frequency, but 
new ones will be introduced based on business needs. 

e Because they are based on business pressures rather than 
purely theoretical technical decisions, new database models 
will not necessarily conform to the assumptions and constraints 
of previous models. 

e There are a number of database concepts that stay relatively 
untouched across models, even as the assumptions and con- 
straints around those concepts change. 


The desired evolution of DBMSs, therefore, could take the 
form of a Postrelational DBMS that decouples the storage mech- 
anism from the access mechanism. One way to think of this is 
that there is an underlying DBMS that operates only in terms 
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of very generic database concepts, analogous to an abstract 
class in object-oriented programming. A DBMS implementation 
could be derived from this base DBMS that applied the con- 
straints and assumptions of a particular model to the data. Ad- 
ditional DBMS implementations could also be placed on top of 
the base DBMS, all having native and natural access to the data. 
Another way to look at this is to think of a data model as a type 
of view— it is a user’s perception of the data. The underlying 
storage mechanism does not need to necessarily conform to that 
view internally, but only to ensure that the view remains con- 
sistent for the consumers of that view. | 

While it is true that each implementation would have the ad- 
ditional overhead of the opaque nature of the underlying DBMS 
interface (and would thus not be able to take advantage of po- 
tential performance efficiencies gained by tight coupling), each 
derived DBMS (or view) would be equally native and act as a 
peer rather than a predefined stop in a pipeline. Thus, a busi- 
ness could run applications that used different data models con- 
currently (even within the same application), all of which run 
with similar and deterministic performance, scalability, and re- 
liability characteristics. 

Will this happen? It’s difficult to imagine it happening rapid- 
ly. The relational model has been dominant for several Internet 
generations, and many database implementations are inextrica- 
ble from the SQL/relational model. Likewise, many of the XML 
solutions are suffering from the growing pains that the relational 
database solutions went through a long time ago. However, as 
the need for efficient distributed data management grows both 
in numbers and in importance, it is likely that there will be a 
competitive advantage in having a solution such as this, re- 
gardless of how sophisticated the pipeline approach becomes. 
If that is the case, then we can rely on the axiom that comput- 
ing nature abhors a competitive vacuum. 


Conclusion 

Organizations have a great deal of investment in their infras- 
tructure incorporating ISAM and relational models. There are 
a number of competitive advantages that can be gained by 
distributed computing (such as web services), and the com- 
mon language of distributed computing is XML. The prob- 
lem is that the XML model makes different assumptions about 
data than the ISAM and relational models. The result is that 
businesses are now tasked with adapting existing infrastruc- 
ture to a new, incompatible data model more quickly than 
ever before. 

There are several ways to accomplish this, but each has draw- 
backs. Some of these drawbacks are more likely than others to 
only show up as the system scales outward; other drawbacks 
are more obvious. Therefore, it is essential that the integration 
of distributed data not merely coast along the path of least re- 
sistance, but that it proceed in the manner best suited to the 
needs of the business. 

There are several possible futures. As unlikely as it seems, 
distributed computing may turn out to be a fad. One partic- 
ular mechanism of model adaptation may improve to the point 
where it satisfactorily addresses the needs of most business- 
es. One of the models may evolve to accept the assumptions 
of the other models, making it the reference model. On the 
other hand, the importance of distributed computing may 
simply force businesses to accept the cost of inefficient data 
transformation. An ideal solution is a DBMS that can apply 
the constraints of any particular model to the underlying data, 
allowing existing infrastructure to perform at current levels 
while providing native and natural support for new models 
as needed. 
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he .NET framework is Microsoft’s an- 

swer to Java and the other technolo- 

gies (such as Sun’s Open Net Envi- 

ronment) for developing web-enabled 
applications. In the process, .NET offers 
a number of features aimed at simplifying 
previously difficult tasks. Classes that en- 
able threading (that is, allowing a single 
process to contain multiple independent 
threads of execution) is one of those ar- 
eas where the .NET framework offers the 
first real opportunity for many program- 
mers to use an advanced feature of the 
operating system. 

Unfortunately for programmers coming 
from Microsoft Visual Basic, Visual Basic 
for Applications, or VBScript, using mul- 
tiple threads can be a little like running 
with scissors. Nothing bad happens most 
of the time, but then, when you least ex- 
pect it— bang! A simple fall can have dis- 
astrous consequences. In this article, I'll 
examine what’s involved in using threads 
and provide background on the problems 
that can occur. 


Uses of Multiple Threads of Execution 
Why might you want to use multiple 
threads of execution? There are two gen- 
eral classes of applications that benefit 
from using threads. 


e The first is a client application that needs 
to do some potentially slow processing 
while still allowing users of the appli- 


Doug is the author of Inside Server-Based 
Applications, moderator of the Database 
topic area on http.//www.ddj.com/, as well 
as owner of Access Microsystems Inc., a 
small consulting company. Doug may be 
reached at doug@accessmicrosystems.com. 


30 





cation to continue working in the ap- 
plication. Printing a document or report 
while users continue editing is the clas- 
sic example of this. 

Server-based applications are the sec- 
ond example of apps that benefit from 
multiple threads. For instance, if there 
are two parts of a process, one relatively 
quick and the other relatively slow, it 
might make sense to have a single 
thread handle the relatively quick task, 
and dole out the results of that task to 
a pool of worker threads that will each 
perform the slower operation. A recent 
example in my experience is an appli- 
cation that reads data from a database, 





does some processing, and sends e-mail. 
Reading the data from the database is 
relatively fast, and doing the processing 
and sending the e-mail (with the po- 
tential for timeouts or errors) is relatively 
slow. A single thread can read the 
database and assign individual threads 
the task of processing the information 
and sending the e-mail for each request. 


Potential Problems with 

Threaded Applications 

If threads are so useful, why not use them 
all the time? First, it is possible that if you 
use enough threads, the application will 
spend more time switching context from 
thread to thread than doing the actual work 
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of the application. Generally, machines run- 
ning the .NET framework will have only a 
single processor. To give the appearance 
of running multiple threads (or applica- 
tions) simultaneously, the processor must 
switch from one context to another quick- 
ly. So, look at the C# code in Listing One. 

While you would expect each line to 
be executed in turn without interruption, 
it is, in fact, possible that some other 
thread or process could get to do some 
of its work between the creation of the 
array and the first initialization. In this ex- 
ample, it is not a problem for these op- 
erations to be interrupted. 

However, imagine the slightly different 
scenario in Listing Two. In this case, you 
can still be sure that the value printed for 
i[3/ will be 3. Now, taking on faith that the 
code created the threads, imagine the 
modification in Listing Three. 

In this example, i/3/ should always equal 
3. But because of the way that this code is 
constructed, there is no guarantee that it 
will equal three. Since initializelt() and 
readIt() are both operating independently, 
there is no assurance that initializelt() will 
get to the assignment 7/3/ before the thread 
running readit() prints out the value. 

The situation would be even worse if 
both threads were writing the value simul- 
taneously, rather than one thread writing 
the value and the other reading the value. 
Still more troublesome is the case of a com- 
plex type. For instance, given the structure 
in Listing Four, when assigning values to 
the individual char elements of an instance 
of the COLOR structure that is accessible 
from multiple threads, the COLOR structure 
could be read between the assignment of 
red and green. Thankfully, there are objects 
within the .NET framework that make this 
threading work cleanly. 


“NET Framework Threading Concepts 

There are several different problems with 
respect to controlling multiple threads. The 
first, which I described in the previous ex- 
ample, is control of access to shared re- 
sources. Other problems include coordi- 
nating operations between threads. For 
instance, a common model for creating a 
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server application is to create multiple 
worker threads, then have those threads 
process requests as needed. This requires 
a different type of operation than merely 
serializing access to global variables. 

If you are familiar with the existing 
Win32 threading concepts and functions, 
a cross-reference to the new .NET thread- 
ing concepts and functions might be use- 
ful. Table 1 presents such a cross-reference. 

The first thing that you should notice 
is that, as with the rest of the .NET frame- 
work, all operations are methods of ob- 
jects, as opposed to the straight Win32 
API calls. In general, the object method 
names are well thought out and reason- 
able. An exception is the oddly named 
Join method on the Thread object, which 
is used to allow one thread to wait for an- 
other thread to terminate. 


A Threading Example 

To understand .NET threading, let’s look 
at a class that does C# thread pooling in 
Listing Five. This class currently uses the 
Sleep method of the Thread class to sim- 
ulate work. There are 10 threads created, 
and there are then 100 calls to dispatch 
some work to a thread. The worker thread 
uses thread synchronization to set up sec- 
tions in the thread that would allow data 
to be shared. Listing Six shows the same 
class using VB.NET. 

Within the .NET framework, external 
functionality is made available in different 
ways than in other languages. In C++ for 
example, it is common to use some ex- 
ternal library. You would #include a head- 
er file and link in a static library that ei- 
ther contained the functionality directly or 
called an external DLL. In Visual Basic, ei- 
ther supplying a prototype for an external 
function in a DLL or adding an external 








component to the References of a project 
included external functionality. In C#, the 
Using keyword allows external functionali- 
ty to be referenced. And in VB.NET, the Jm- 
ports keyword does approximately the same 
thing. The argument to either the Using or 
Imports command in their respective lan- 
guages is a namespace that should be im- 
ported. In Listing Five, the similar names- 
paces System and System.Threading are 
used. This is the way that Java imports ex- 
ternal functionality. But in the case of Java, 
ending the import with an .* will import 
all classes further down in the hierarchy. 
There is no similar “include everything be- 
low this” syntax in C# or VB.NET. 

As with all C# programs, everything is 
part of a class, even the entry point to the 
program, called public static int Main(string/] 
args) in the ThreadTest class. One im- 
portant factor of Main() being a static 
member of the class is that there are no 
instance variables available. In this exam- 
ple, when the Main() static method of 
ThreadTest is first entered, there is no in- 
stance of ThreadTest that has been creat- 
ed yet. The first thing that Main() does is 
create an instance of ThreadTest and de- 
clare a variable of type Thread. 

Once the instance of ThreadTest is cre- 
ated, Listing Five enters a loop to create 
10 threads, each using the WorkerThread() 
method of the 7hreadTest instance just cre- 
ated as the thread entry point; see Listing 
Seven. 

As mentioned in Table 1, creating a 
thread in the .NET framework is a two- 
step process (as opposed to the single- 
step CreateThread() in the Win32 API). 
First, the thread is created with the new 
keyword, and a newly created instance of 
ThreadStart is passed to the Thread con- 
structor. The 7hreadStart constructor has 
been passed the object method that you 


Table 1: Win32 to .NET threading concepts conversion. 
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wish to use as the entry point for the new 
thread. In this example, I am not keeping 
track of each thread object as it is creat- 
ed. In the real world, the Thread object 
instances would be stored in an array so 
that they could later be controlled, and so 
that Main() could ensure that all threads 
are terminated before Main() is exited, 
effectively killing off any existing threads. 
A thread is not actually doing anything 
until Start() is called on the instance of 
the thread. This is truly just a method to 
start a thread. Once a thread is dead, it 
cannot be restarted using Start().. 

Once the for loop is exited, all 10 
threads are active and have started to ex- 
ecute. In this case, because of the way 
that the WorkerThread() method is cre- 
ated, all the threads will wait as one of 
their first actions. If, instead, a thread need- 
ed to have some other action take place 
after creation but before execution, the 
call to the Start() method of the class 
could be delayed. 


The Worker Thread 

Before going any further in the Main() 
method, look at the WorkerThread() 
method, which is designed to do the actu- 
al work of the class. The name of the 
method is unimportant and could have been 
any legal method name, but I call it Work- 
erThread() for clarity. Other features of the 
method are quite important. In the Win32 
world, if you want to use a class method 
as the entry point for a thread, the method 
must be a static method. Nonstatic meth- 
ods of a C++ class (and, presumably, a C# 
class) have an unseen this argument that is 
passed to the method as a first parameter. 
The this pointer is used to allow the method 
to use instance data for the particular in- 
stance of the class it is called upon. The 
opposite is true for thread entry points: They 
must be nonstatic members of the class. 

Further, unlike the Win32 ThreadProc 
that takes a single argument of a void 
pointer (a pointer to anything, really), the 
object method used as an entry point for 
.NET applications has no argument. Pre- 
sumably, there is a way to pass context 
information to a thread entry point; but 
given the current state of the .NET frame- 
work documentation, it is not clear ex- 
actly how that is handled (see the ac- 
companying text box entitled “Data Slots 
and Beta Software...”). For the Work- 
erThread() method in this 7hreadTest 
class, the thread is designed to perform 
multiple loops over and over again with 
a different context each time, so the lack 
of a clear way to pass in context as the 
thread is started is not critical. 

A consequence of this change in the 
entry point function is that new threads 
have a concept of this— meaning that in- 
stance variables are available within the 
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thread. The availability of the instance data 
is useful, but of course, access to the in- 
stance data must be guarded. Looking at 
Listing Five, the first thing that the Work- 


erThread() method does is: 


int th=0; 
lock(cs) 
{ 
threadNum++; 
=threadNum; 
} 


A local integer variable is declared (th) 
and the Jock() statement is used to seri- 
alize access to a section of code. In this 
case, I am setting a variable local to Work- 
erTbread() to the relative number of this 
thread. The lone argument to Jock() is an 
object that lets you control the granulari- 
ty of locking. For example, whenever one 
thread is within the block of code fol- 
lowing the aforementioned /ock() state- 
ment, no other thread can enter a block 
of code with the same object as the ar- 
gument. However, if there was code such 
as the following elsewhere in the class: 


lock(this) 
{ 
threadNum++; 
th=threadNum; 
} 


the block could be entered, since the block 
is locking on a different object. Doing this 
would be a bad idea, since the variables 
being manipulated here are the same as 
the variables being manipulated in the 
previous block locked on cs, but the abil- 
ity to lock on different objects can be use- 
ful in other scenarios. Programmers will 
commonly use the this object for locking 
if they are only locking one set of vari- 
ables. Always think carefully about the 
amount of work you do inside a syn- 
chronized section of code, as well as what 
level of locking granularity you need. For 
instance, if you always lock on the this 
object, the class will be serialized through 
these sections. That is, the code within 
the locked sections will only be execut- 
ing one thread at a time. If large sections 
of code are locked in this way, the multi- 
threaded application may perform no bet- 
ter than a single threaded application. In 
Listing Five, I use the cs object (declared 
and instantiated as an object) just to re- 
mind you how similar this is to the Criti- 
cal Section from the Win32 API. The 
VB.NET equivalent of lock() is SyncLock. 

What is really happening when lock() 
or SyncLock is called? Listing Eight is 
roughly equivalent to Listing Nine. Why 
might you ever use the underlying Sys- 
tem.Threading.Monitor object? While 
lock() is convenient, there are times when 
the extended features available in the full 
object might be required. For instance, 
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System.Threading.Monitor has a set of 
TryEnter and Wait methods that let a 
thread test to see if the Monitor can be 
entered without blocking, or with block- 


ing for a finite period of time. When cre- 
ating a server application, you need to be 
able to avoid locking forever so that the 
server can end based upon a user’s com- 
mand or some other external event. 
Next in Listing Five, WorkerThread() en- 
ters a loop controlled by the value of a flag, 
bRunning, that tells the thread when it 
should end. Inside this loop, there is an if 
statement that uses the AutokesetEvent ob- 
ject (a member of the class); see Listing Ten. 
One of the most misunderstood syn- 
chronization objects in Win32 (and likely 
to be misunderstood in .NET) is the event 
object. An “event” is exactly what you say 





it is, no more. So, within the Main() 
method of our example, you are in a loop 
and you set the event object using the 
Set() method. All this means is that you 
are signaling — telling any waiting threads, 
“Hello! I have just set this event!” Iam not 
sure what would have been a better name 
than “event,” but it adds to the confusion 
because folks presume that there is some 
tie to Windows events, such as a window 
opening. 

In Win32, an event object is created 
using CreateEvent(), and whether it is a 
manual reset event or an automatic reset 
event, it is controlled by a flag to the cre- 
ate function. In .NET, there are two sep- 
arate but similar objects— one called Sys- 
tem.Threading. ManualResetEvent and the 

(continued on page 36) 
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other System. Threading.AutoResetEvent. 
The difference is that the manual reset 
event remains signaled until it is reset man- 
ually. The auto reset event is automatical- 
ly reset when a single waiting thread is re- 
leased. One, and only one, thread will be 
released, even if multiple threads are wait- 
ing on the event. 

The behavior of the AutoResetEvent is 
perfect for what Listing Five is trying to 
do— parcel out work to one, and only 
one, waiting thread. Thus, when the 
thread calls the WaitOne() method on the 
event object with a timeout of 1000 (1000 
milliseconds or 1 second), if the main 
thread sets the event and this thread is re- 





leased, it can be sure that it is the only 
worker thread being released. If the Wait- 
One() method returns False, then the main 
thread did not set the event in that one 
second, or some other threads were re- 
leased by signaling. If no thread is wait- 
ing, then the event will remain signaled. 
In Listing Five, all I do is display a mes- 
sage on the console that lets you know if 
the thread was signaled or if it had to 
sleep. Note that Listing Six contains a 
VB.NET version of Listing Five. 


What Else Is in 

-NET, and What Is Missing? 

Thread pooling, introduced in Windows 
2000, is available in .NET, and it has some 





of the same limitations as far as fine con- 
trol over the number of threads in the 
pool. This has caused me to shy away 
from it. Virtually every threading primi- 
tive and API is somehow represented in 
.NET. Threads in .NET can also be either 
Foreground or Background threads. The 
difference between the two is that once 
only Background threads are running, 
the run time will kill them by throwing 
a ThreadAbortException. Background 
threads can be used to execute code that 
is useful only within the context of some 
other foreground operation that will be 
running on a Foreground thread. Once 
the last Foreground thread is terminated, 
the Background threads alone will not 
keep the runtime alive and will thus be 
terminated. 

Given the rich object model available 
for threading, you might think that ev- 
erything from Win32 is in .NET. That is 
true, with a few exceptions. There is still 
some uncertainty as to how context can 
be passed to threads, but I believe that 
this is just a documentation oversight. 
There is another minor omission as well: 
the lack of a spin count equivalent. In 
Win32, for multiprocessor machines, you 
can set a spin count to allow a thread 
waiting on a critical section to briefly de- 
lay (some time controlled by the spin 
count) if a critical section is owned by 
another thread. This is done in the hope 
that the critical section will be available 
after the brief delay, saving a relatively 
expensive trip to access the underlying 
kernel object. This is not a terrible omis- 
sion, and in any event, it can be replaced 
by developers if required. 


DDJ 





Listing One 
Int[] i = new int[4]; 
i[@]=0; 


e e 
Listing Two 
namespace ThreadingArticle 


using System; 
public class Class1 
{ 
int[] i; 
public Classi () 
{ 


i = new int[4]; 
} 
public static int Main(string[] args) 
{ 


Classi c = new Classi(); 
c.initializeIt(); 
c.readIt() 
return Q; 
} 
public void readIt() 
{ 


System. Console.WriteLine("i[3]={0}",i[3]); 
public void initializeIt() 


i[@]=0; 


i[1]=1; 
i[2]=2; 
i[3]=3; 
} 
} 


e e 
Listing Three 
namespace ThreadingArticle 


using System; 
using System.Threading; 
public class Class1 
{ 
int[] i; 
public Class1() 
{ 


i = new int[4]; 


} 


public static int Main(string[] args) 
{ 


Classi c = new Classi(); 


Thread t 


null; 


// Create a thread to run initializeIt() 


t = new 


Thread(new ThreadStart(c.initializeIt) ); 


t.Start(); 


// Create a thread to run readIt() 


t = new 


Thread(new ThreadStart(c.readIt)); 


t.Start(); 
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return @; 
} 
public void readIt() 
{ 
System.Console.WriteLine("i[3]={0}",i[3]); 
} 
public void initializeIt() 
{ 
i[@]=0; 
i[1]=1; 
i[2]=2; 
i[3]=3; 
} 
} 
} 
e e 
Listing Four 
Struct COLOR { 
char red; 
char green; 
char blue; 
} 
° e e 
Listing Five 
namespace ThreadTest 
{ 


using System; 
using System.Threading; 


public class ThreadTest 
{ 
public System.Threading.AutoResetEvent hEvent; 
// Object to lock() on. 
public Object cs; 
public int threadNum; 
public bool bRunning; 
public ThreadTest () 
{ ° 
bRunning=true; 
hEvent=new System.Threading.AutoResetEvent (false) ; 
cs=new Object(); 
threadNum=0; 
} 
public void WorkerThread() 
{ 


int th=0; 

lock (cs) 

{ 
threadNumtt; 
th=threadNum; 

} 

while ( bRunning==true ) 


if (hEvent.WaitOne(1000,false)==true) 


{ 
lock (cs) 
{ 
System.Console.WriteLine("Thread {%} Got Handle, 
and could safely get variable",th) ; 
} 
} 
else 
{ 


Thread. Sleep (1909) ; 
System.Console.WriteLine("Thread {@} Slept...",th); 


} 
} // End of while (bRunning==true) 

}  // End of WorkerThread() 

public static int Main(string[] args) 

{ 

// We use this to get an object reference to this method. In the 


// olden days, we would use a static method, because non-static methods 


// have magic, invisible "this" pointers. 
ThreadTest b=new ThreadTest(); 
Thread t=null; 


// If this was doing work we cared about, would have an array of Thread 


// objects, and then use .Join() method to ensure they had exited... 
for (int loop=@ ; loop < 10 ; loopt++) 
{ 
t = new Thread(new ThreadStart (b.WorkerThread) ) ; 
t.Start(); 
} 
for (int loop=0;loop<100;loopt+) 
{ 


Random r = new Random() ; 
// Just to make it seem like some work was happening... 
Thread.Sleep(r.Next(1,20@)); 


// We lock on cs. The WorkerThread() will, after returning 
// successfully from wait, try to acquire lock as well. I will 
// not try and set event again until thread is done with that 
// section, presumably, after having read some data. 
lock (b.cs) 
{ 
// Muck with data that is shared, and then set event... 
// should check if this returns true or false... 
b.hEvent.Set() ; 
} 
} 
b. bRunning=false; 
return 9; 
} // End of Main() 
} // End of Namespace 
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Listing Six 

Imports System. Threading 

Module Module1 

Sub Main () 

Dim b As New ThreadTestVB() 
Dim t As Thread 
Dim tloop As Integer 
Dim r As Random 
Dim ts As ThreadStart 


t = Nothing 

For tloop = 1 To 10 
ts = New ThreadStart(AddressOf b.WorkerThread) 
t = New Thread (ts) 
t.Start() 

Next 

r = New Random() 

For. tloop = 1 To 100 
Thread.Sleep(r.Next(1, 2@0)) 
SyncLock (b.cs) 

b.hEvent.Set() 
End SyncLock 
Next 
End Sub 
Public Class ThreadTestVB 

Public Dim hEvent As AutoResetEvent 

Public Dim cs As Object 

Dim threadNum As Integer 

Dim bRunning As Boolean 


Public Sub New() 
bRunning = True 
hEvent = New AutoResetEvent (False) 
cs = New Object() 
threadNum = @ 

End Sub 


Public Sub WorkerThread () 
Dim th As Integer 
SyncLock (cs) 
threadNum = threadNum + 1 
th = threadNum 
End SyncLock 
While bRunning = True 
If hEvent.WaitOne(1000, False) = True Then 
SyncLock (cs) 
System.Console.WriteLine( _ 
"Thread {@} Got Handle and could safely get variable", _th) 
End SyncLock 
Else 
Thread. Sleep (190) 
system.Console.WriteLine( _"Thread {0} Slept", th) 
End If 
End While 
End Sub 
End Class 
End Module 


e e 
Listing Seven 
for ( int loop=@ ;loop<1@ ; loopt+) 
{ 


t = new Thread(new ThreadStart (b.WorkerThread)) ; 
t.Start(): 


Listing Eight 


System. Threading.Monitor.Enter (this) ; 


try 
{ 
// code here 
} 
finally 
{ 
System. Threading.Monitor.Exit (this) ; 
} 


Listing Nine 


lock () 
{ 
// code here 
} 
e e 
Listing Ten 
if (hEvent.WaitOne(1000,false)==true) 
{ 
lock (cs) 
{ 
System. Console.WriteLine( 
"Thread {@} Got Handle, and could safely get variable", th); 
} 
} 
else 
{ 
Thread. Sleep (1090) ; 
System.Console.WriteLine("Thread {0} Slept...",th); 
DDJ 
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The Strix Object 








Relational databases 
meet object-oriented 
languages 








César A. Gonzalez Perez 


espite modern APIs and interfaces, 
using relational databases with 
object-oriented languages continues 
to be problematic. Strix, a software 
layer that works between an application 
and the relational database backend that 
stores its data, was designed to address 
this problem. Instead of interfacing di- 
rectly with the database engine, the ap- 
plication talks to the Strix run time (SRT) 
which, in turn, interacts with the database 
through “store managers” (SM). Store man- 
agers are necessary because different 
database products speak different lan- 
guages (at least, different dialects of SQL) 
and offer different capacities (such as 
transactioning, cursor types, or syntactic 
rules for identifiers). Store managers are 
dynamically loaded and unloaded by the 
SRT as needed. The interaction between 
the SRT and the store managers takes 
place though an intermediate layer origi- 
nally named “Strix Run-Time/Store Man- 
ager Interface” (SRTSMD, which provides 
extra services such as tracing the flow of 
information between the application and 
the database backend, or providing an API 
for you to write new store managers. Fig- 
ure 1 illustrates the Strix architecture. 
This database structure can be accessed 
through the SRT and a suitable backend- 
specific store manager, and requests to cre- 
ate, modify, retrieve, and delete objects can 
be issued from client programs. The Owl 
query language (which closely resembles 


César is founder and leader of Neco, a com- 

pany specializing in software development 
support and OO methodologies in Santiago 
de Compostela, Spain. You can reach him 
at cesargon@neco-ti.com. 
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SQL) can perform arbitrary object-oriented 
queries. Finally, the Scops software layer, 
which operates on top of Strix, can re- 
compile the specified class model with 
some more information to generate lan- 
guage-specific wrappers for the corre- 
sponding classes, attributes, and relation- 
ships. Using Scops on top of Strix lets you 
forget about the existence of a persistence 
engine and live in a “total persistence” 
world, where every object created is per- 
sistent unless specified otherwise. 

Strix evolved from research into repre- 
sentational systems and metainformation at 
the Laboratory of Archaeology and Cultur- 
al Forms (http://www- gtarpa.usc.es/) at 





the University of Santiago de Compostela, 
and Verdewek (http://www.verdewek 
.com/), the research division of Neco 
(http://www.neco-ti.com/). The BlueMoon 
project attempts to integrate Strix, the Metis 
methodology, hierarchical enumerators, 
emerging user interfaces, and substruc- 
tured storage into usable products. 


Strix Concepts 

Two components play special roles in 
Strix: the metainformation manager (Mi- 
Man) and the generic information man- 
ager (GiMan). MiMan implements object 
stereotypes (such as class, attribute, and 
relationship) and builds an in-memory net- 
work of objects representing the structure 
of the system described by the develop- 
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Persistence Engine 


er. GiMan, in turn, implements domain- 
level concepts such as object (an instance 
of a class), value (an instance of an at- 
tribute), and tuple (an instance of a rela- 
tionship), and lets clients instance the 
metainformation provided by MiMan, man- 
age instances and even serialize and de- 
serialize them from/to different formats 
such as a binary stream, XML, or plain 
text. Figure 2 is the example class model 
I use throughout this article; Figures 3 and 
4 show sample portions of in-memory 
structures managed by MiMan and GiMan 
and are available to Strix. 

A central Strix concept is that of system— 
a set of classes, relationships, and other con- 
structs that comprise a fully specified ap- 
plication domain. Figure 2 serves as an 
example. For the most relationally inclined, 
a system is for Strix as a database is for any 
conventional relational product. For Strix to 
operate with a system, it must be described 
by you using the Middle language, which 
offers the classic object-oriented stereotypes 
(class, relationship, generalization, attribute, 
and role), some not-so-classical ones (hi- 
erarchical enumerators, multivalued at- 
tributes, and relationships with multiple 
properties such as boundness or naviga- 
bility), and a rich set of data types (Booleans, 
integers, floating-point decimal, fixed-point 
decimal, strings, timestamps, timespans, and 
binary data). 

Listing One shows the Middle specifica- 
tion of the system in Figure 2. The system 
is named “Test,” and given the system ID 
Verdewek/Test. An enum declaration spec- 
ifies the possible values for attributes of 
type EProjectType, using the built-in hier- 
archical enumerator notation. Then, a list 
of class declarations follow. Attribute claus- 
es inside each class include the attribute 
name, its cardinality (0..1 if omitted; that is, 
a classical single-valued attribute that is nul- 
lable), its type (built-in or a declared enum), 
and in some cases a size (just for string and 
binary data types). Specializations and gen- 
eralizations are introduced by “isa” claus- 
es. The “reference” clauses, in turn, include 
the reference name, its cardinality (0..n if 
omitted), its target class optionally preceded 
by a role name in square brackets, an in- 
verse reference specification that links each 
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reference to its inverse one so both con- 
form a whole relationship, and some mod- 
ifiers such as bound or main. The bound 
modifier gives the reference bound se- 
mantics. (Bound semantics are often used 
to implement aggregations as opposed to 
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associations. However, Middle does not 
make this distinction, and bound seman- 
tics can be freely applied to references and 
combined with other properties.) Conse- 
quently, the lifetime of the referenced ob- 
ject becomes dependent on the referrer. 
The main modifier marks the reference as 
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more important than its inverse toward re- 
lationship identification. Finally, “invariant” 
clauses specify Boolean conditions that 
must always stay True for the object’s life- 
time. Invariants are not supported in the 
current version of Strix. 


Design Considerations 

Our main goals in constructing Strix was to 
make it simple to use, and suitable not only 
for large databases and systems (which in 
turn, usually imply bigger human teams and 
budgets), but also for small- and medium- 
sized projects. Also, we wanted to obtain 
an extensible and adaptable framework so 
it could be changed with little effort and 
customized by its final users. 

We made several decisions to meet these 
aims. First, we decided not to write a 
database system but to use existing ones, 
borrowing from their I/O, caching, and con- 
currency features. At the same time, we 
wanted to make Strix independent of the 
specific backend used to store data, so we 
took a modular approach. Common func- 
tionality regarding system installation and 
management, object-relational mapping, 
and GiObject construction and management 
is implemented by the SRT. On the other 
hand, product-specific functionality such as 
SQL generation, identifier management, and 
direct interaction with whatever library is 
used to access the database engine are im- 
plemented by store managers. A Microsoft 
Jet 4 store manager has been developed, 
and others will follow. The Strix run-time 
and the store managers communicate 
through a programmatic interface imple- 
mented by a separate component, the SRTS- 
MI, which is needed to isolate SRT’s “down- 
wards” interface to store managers from its 
“upwards” interface to applications; see Fig- 
ure 5. The SRTSMI, in fact, implements the 
Asio language, an abstraction of SQL that 
allows declarative communications between 
SRT and store managers. 

Another important decision was that of 
using a ubiquitous infrastructure to sup- 
port nearly all operations performed on 
persistent objects—the GiObject. Of 
course, using GiObjects adds another lay- 
er to the already thick pile of memory 
structures, COM, ActiveX, and Visual Ba- 
sic (or whatever language you use), but it 
also brings in some advantages. The main 
one is that metainformation can be made 
dynamic and changed at run time. C++ or 
Visual Basic objects, for instance, have to 
be based on classes defined at design time; 
GiObjects don’t. In coherence with this, 
classes, attributes, and relationships (ob- 
jects on their own) could be freely creat- 
ed, modified, and deleted at run time, al- 
though the current version of Strix does 
not support it. In fact, we found this func- 
tionality quite challenging but of little con- 
venience in usual business scenarios. 
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Listen to your Heart. 


“Don’t follow the crowd if you want to be a successful application developer. 

When we were asked to come up with a mobile-gaming platform for a 
client, our chief bit-fiddler wanted something from one of the big dinosaurs. 
But my heart was screaming FREE-DOM! 

| slammed my fist on the table and shouted: ‘Boss, we need Ellipsus. 

Haven’t heard about Ellipsus? Well, they’ve developed infiniteMASS"™’, 
the industry's best Mobile Application Server Suite. It’s so good you'll 
never have to worry about platforms, protocols, and devices again. 

The project was a snap. So, if you care about your career, go straight to 
www.ellipsus.com. Download the software and show the boss. The benefits 
are massive, maybe even infinite.” 





Michael McCartney, 
Filthy Rich Developer 
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Finally, some extensibility mechanisms 
had to be built in Strix so that you could 
hook in its internal workings. The SRTSMI 
component defines an abstract interface that 
user-defined libraries can implement to gain 
access to some Strix internals. Using this 
feature, Strix triggers events as selected op- 
erations are performed, providing a sim- 
ple way to monitor, trace, or intercept them. 


Implementation 

Strix was implemented on Windows using 
Microsoft’s Visual Basic 6 and Visual C++ 
6. The SRT, SRTSMI, and store managers 
are implemented as ActiveX DLLs. An ap- 
plication wanting to use Strix would import 
the SRT type library through Project Refer- 
ences in Visual Basic or create a class wrap- 
per in Visual C++ and directly call its meth- 
ods; in this sense, using Strix is comparable 
to using Microsoft ActiveX Data Objects 
(ADO) or any similar database library. 

Different store managers may coexist 
in the same computer, and each of them 
must be registered in the Windows Reg- 
istry so the SRT can find them as needed. 
Store manager registration must be per- 
formed by setup programs following the 
guidelines specified in the Strix Store Man- 
ager Development Kit (available soon at 
http://www.verdewek.com/). 

At the database level, Strix stores ob- 
jects in tables following some hard-wired 
conventions that are completed with a 
small amount of heuristics done at system 
installation time. The mapping between 
the object-oriented and the relational 
worlds is done as follows: 


e A table is created for each class in the 
system. Each row in the table corre- 
sponds to an object. An Old column is 
added to hold object identifiers. 

e Another column is created for each 
single-valued owned attribute (those with 





a maximum cardinality of 1), mapping Mid- 
dle data types into backend-specific ones. 
Each multivalued attribute (those with 
maximum cardinality greater than 1) is 
implemented in one of three possible 
ways, depending on its maximum cardi- 
nality and some other parameters: 1. For 
small maximum cardinalities, separate 
columns in the same table may be used 
to store different values of the attribute. 
2. For larger maximum cardinalities or 
other scenarios, a single column in the 
same table can be used to store all the 
values for the attribute, encoded in a sin- 
gle string. 3. Finally, an additional table 
may be created and related one-to-many 
with the main table; each row will con- 
tain a single value for the attribute and a 
link to the related row in the main table. 
Tables corresponding to classes include 
columns for owned attributes; that is, 
attributes defined by themselves and not 
inherited from superclasses. This way, 
specializations are implemented dis- 
tributing object data across several ta- 
bles and allowing abstract views of ob- 
jects by looking at each table in isolation. 
Figure 6 shows an example of this. 
References between classes may be im- 
plemented in one of two possible ways: 
1. For one-to-one or many-to-one ref- 
erences, an additional column may be 
added to the class table and linked to 
the related class table. Navigational 
queries can be resolved easily, but in- 
verse ones must be performed by 
lookup. 2. For one-to-many or many- 
to-many references, an additional table 
is created with two columns, which are 
linked to the related class tables. 
Hierarchical enumerators are stored in 
a specific table and enumerator items 
are stored in another table. Items are 
assigned encoded string identifiers so 
that abstract comparison can be per- 
formed without hierarchical navigation. 


Figure 4: Generic information managed by GiMan. 
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Unique indexes are created for Old 
columns and nonunique indexes may be 
created for columns involved in relation- 
ships. In any case, these conventions may 
vary in future versions of Strix as the in- 
stallation engine becomes more sophisti- 
cated. In particular, hints given by the de- 
veloper will be considered in a future 
revision. 


Strix Dynamics and Use 

To use Strix, you must write a text file con- 
taining the Middle specification of the sys- 
tem to implement, as well as decide on a 
database backend to use. The system can 
then be installed into Strix by calling the 
StrixkunTime. InstallSystem operation, and 
passing as arguments the Middle filename, 
an identifier of the chosen backend, and any 
backend-specific parameters. If a suitable 
store manager is available, Strix compiles 
the Middle specification and, if no errors are 
found, stores system metainformation for 
future use. Also, it loads the appropriate 
store manager and directs it to create the 
database structure that serves as data store. 
Once this is done, system installation is con- 
sidered complete and the system can be used 
normally. Figure 7 is a sample system as 
shown by the MiMan Explorer tool. 

To use an installed system, you must 
open it by calling StrixkunTime.OpenSys- 
tem and pass the desired system’s name 
as an argument. This operation returns a 
pointer to a System object, which ee 
the following services: 


e Metainformation 
e CreateObject 

e RetrieveObject 

© Query 

e SaveObject 
DeleteObject 

¢ CloseSystem 


The Metainformation property returns 
a pointer to a MlPackage object, imple- 
mented by MiMan, which lets you navigate 
the system’s metainformation and examine 
its classes, attributes, relationships, and 


enums. 
The sceinte aia lets you create a new 





Figure S isolating interfaces in thie 
Strix run time. 
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be passed as argument. The SRT relies on 
GiMan to create a GiObject object, and 
then attaches it to the specified class, gives 
it a unique object ID (Old), calls the store 
manager to insert it into the database, and 
returns a pointer to the GiObject to the 
caller. The GiObject, in turn, offers col- 
lections of values and tuples correspond- 
ing to its class’s attributes and references, 
as well as the option to jump into the 
metainformation realm from any instance- 
level entity (for example, from a value to 
the linked attribute, or from an object to 
the correspondent class). Also, a GiOb- 
ject can be serialized into a byte stream 
in any of several formats. Currently sup- 
ported formats include the proprietary Se- 
rial Object Format (SOF), XML (a simple 
DTD is used), and human-readable text. 

RetrieveObject takes a single Old as ar- 
gument and returns a pointer to the re- 
trieved GiObject. Once again, it relies on 
GiMan to create a GiObject, then invokes 
the store manager to query the underly- 
ing database and populate the GiObject 
with the proper values and tuples. 

The Query operation also retrieves ob- 
jects from the database but lets you sub- 
mit a declarative query written in the Owl 
query language. On return, the operation 
will give the caller a pointer to a GiOb- 
jectCol object, which holds a collection of 
zero or more GlObjects. The Owl language 
closely resembles SQL but navigational ex- 
pressions give it much more power. For 
example, the query: 


SELECT Employee e WHERE e.WorksIn. 
Requester.Name LIKE 'a*' 


retrieves all employees working in projects 
that have been requested by customers with 
a name starting with the letter “a.” Rela- 
tionships can be navigated by relationship 
name or role name. Owl does not support 


(wi: SELECT Employee € 
WHERE OIP(e} = 10015; 








joins in the SQL sense, but it does support 
navigational expressions, object compari- 
son, abstract comparison (antisymmetric 
version of conventional comparison that 
yields Car = Vehicle but Vehicle <> Car, 
useful with hierarchical enumerators), and 
collection membership test. 

Once an object is created or retrieved, 
the programmer can modify it in a num- 
ber of ways: adding values for its attributes, 
editing or deleting existent values, adding 
new tuples for its references, or removing 
existent tuples. These changes are accom- 
plished through GiMan’s classes and op- 
erations. To make changes persistent, the 
SaveObject operation must be called, pass- 
ing the edited GiObject as an argument. 

Deleting an object is as simple as call- 
ing DeleteObject and passing an Old as ar- 
gument. Finally, CloseSystem closes the 
system and releases resources. 


Extending Strix 
Strix can be extended at three major levels: 


e Developing new store managers so it 
can use alternative backends. 

¢ Monitoring information exchange at the 
SRTSMI. 

e Using higher level layers on top of Strix. 


To develop a new store manager, you 
must familiarize yourself with the two par- 
ties involved: the SRT and the specific back- 
end to use. Interfacing with the SRT is done 
through the SRTSMI, so we have prepared 
a Strix Store Manager Development Kit 
(SSMDK) that includes reference materials, 


practical recommendations, and a proposed 


architecture for new store managers. Inter- 
facing with the specific backend is usually 
done by issuing SQL sentences to the 
database server through any available library. 
For example, the Microsoft Jet 4 store man- 
ager uses the ActiveX Data Objects (ADO) 


SQL: SELECT Person.OId, Person.Name, Person.Address, Employee.Wage, Employee .Job 
FROM Person INNER JOIN Employee ON Person.OTd = Employee .OId 
WHERE Person.OId = 16015; 


Owl: SELECT Person p 
. RHERE p.Address LIKE ‘“%strt’ ; 


SQL: SELECT Person.GId, Person.Name, Person .Address 


PROM Person 
WHERE Address LIKE ‘tstrc%’; 





Figure 6: Distributing object data across tables. 
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2.5 library. The main purpose of a store 
manager, actually, is to translate from Asio 
commands sent by the SRT into SQL sen- 
tences or other database manipulation com- 
mands. An Asio interpreter is therefore a 
central component of every store manager. 
The SRTSMI provides a number of services 
to help you in dealing with Asio sentences. 

Monitoring information flow inside Strix 
is another interesting extension. The SRT- 
SMI defines the [StrixExtension interface, 
which can be implemented by any COM 
object wanting to receive events from 
SRTSMI. Fhis COM object should register 
itself with SRTSMI, specifying which kinds 
of events, store managers, and systems are 
of interest, so it receives an event each 
time an Asio command is sent to a store 
manager or some other operations are per- 
formed. The StrixTrace tool uses this fea- 
ture to show a trace of all information 
passing through Strix. 

Finally, using wrapper components to 
access Strix instead of directly interfacing 
to it can be useful in many cases. The Scops 
component (see Figure 1) offers language- 
specific class wrappers for the classes de- 
fined in the system, so a more convenient 
programmatic interface can be used. List- 
ings Two and Three are Visual Basic code 
for retrieving a list of all projects request- 
ed by a specific customer using raw Strix 
and using Scops. With Scops, code is short- 
er and more readable, and strong typing 
at the application-domain level avoids many 
run-time type checks. 

In addition, the Ulula component (Fig- 
ure 1) implements an emergent-user in- 
terface engine, so user-interface elements 
(such as forms and reports) can be dy- 
namically built at run time or installation- 
time instead of having to be hardcoded 
by hand. Ulula uses metainformation to 
set up forms and fields, pick the most 
appropriate user-interface control (text 
boxes, drop-down lists, check boxes, and 


es 








Figure 7: Sample system shown by the 
MiMan Explorer. 
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sO On) to edit each attribute value, and of- 
fer relating, grouping, and sorting options 
when building a report definition. Adding 
an attribute to a class or changing the class 
model itself would not require manually 
editing the application front end because 
Ulula will change the user interface ac- 
cordingly the next time it is invoked. Of 


course, a performance penalty exists, as 
working versions of Ulula demonstrate, 
but today’s powerful computers are much 
cheaper than human workforce. 
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Listing Two 
‘retrieve target customer. 


Dim obCustomer As GiObject 
[retrieve it in any way you choose] 


Listing One 


system Test {Verdewek/Test} 
enum EProjectType 


InformationSystem 
ip apa "prepare collection to hold projects for the customer. 
ActiveXControl Dim obsProjects As Gi0bjectCol 
f Set obsProjects = 
WebSite obCustomer . Package. CreateObjectCol ("Project") 
end Dim obProject As GiObject 


class Person abstract 
attribute Name: 1 String 40 
attribute Address: String 200 
end 
class Customer 
isa Person 
attribute Code: 1 String 8 
reference Requests: 1..n Project inverse RequestedBy bound 
end 
class Employee 


‘iterate tuples of target customer. 
Dim tu As GiTuple 
For Each tu In obCustomer.Tuples 
‘if tuple belongs to the Requests reference... 
If tu.Reference.Name = "Requests" Then 
"retrieve related project. 
Set obProject = sy.RetrieveObject (tu.TargetOId) 
‘and add it to the collection. 
obsProjects.Add obProject 


isa Person End If 
attribute Wage: Fixed Next tu 
attribute Job: String 40 eae 

reference WorksIn: Project inverse HasTeam main Listing Three 


end ; ‘retrieve target customer. 
class Project : Dim cus As Customer 
attribute Name: 1 String 40 [retrieve it in any way you choose] 
attribute Type: 1..n EProjectType 
attribute DateDue: TimeStamp 
attribute Finished: Boolean 
reference RequestedBy: 1 [Requester] Customer inverse 
Requests main 
reference HasTeam: [TeamMembers] Employee inverse WorksIn 
invariant FinishedInPast: Not Finished Or (Finished And DateDue< Now()) 
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"prepare collection to hold projects for the customer. 
Dim pjs As Projects 


"retrieve project list. 
Set pjs = cus.Requests 
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A JDBC Wrapper— 
In Python! 





A framework for 
simplifying database 
access 





C.K. Tan 


hile generally straightforward, 
database access via JOBC can be 
verbose and mundane, making 
the likelihood of errors always a 
concern. To simplify the process and 
lessen the chance for errors, I'll present in 
this article a Python-based framework that 
wraps around JDBC. This framework 
(available electronically; see “Resource 
Center,” page 5) is based upon a con- 
nection pool, transaction class, and JOBC 
code generator. For the purpose of illus- 
tration, I use an Oracle database with the 
table EMPLOYEE(id number, name var- 
char(50), salary number(9, 2), hired_date 
timestamp). 

Listing One presents Insert/Update/Se- 
lect/Delete queries on the EMPLOYEE 
table using regular JDBC calls. Contrast 
this to Listing Two, in which I use the 
Python-based framework for the same 
queries. As you can see, the framework 
results in considerable code savings, not 





C.K. can be contacted at cktan@ 
uwalumni.com. 
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to mention improved type checking. The 
execute() method of update queries and 
the getOneRow() method of select queries 
takes a typed argument list. The frame- 
work also does away with the getX(col- 
umn#)/setX(column#) methods in the 
base JDBC interface. These methods are 





the usual bug originators when modifica- 
tions to existing queries neglect updates 
of the column number constants in the 
getX/setX methods. The framework also 
provides facilities to instrument the queries 
for performance measurement, and helps 
in organizing source code for both de- 
velopers and DBAs. 


Connection Pool 


The DbConnection class represents a 
database connection. Each DbConnection 
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instance contains a java.sql.Connection 
object and a hashtable cache of prepared 
statements. A public static init() method 
must be called to initialize the framework. 
This method sets up a DbConnection pool, 
which is basically a stack of DbConnec- 
tions. The init() method takes as argu- 
ments the number of connections to put 
in the pool, the database user ID, database 
user password, JDBC connection URL, and 
the JDBC driver name. Listing Three shows 
how I connect to an Oracle database and 
an IBM DB2 database. 


DbTrans 

The DbTrans class represents a database 
transaction. To start a new transaction (as 
in Listing Two), I simply do a new Db- 
Trans(). To complete the transaction, I 
call the close) method, passing it the com- 
mit or abort flag. It is important that each 
transaction gets closed because each new 
Transaction instance checks out a Db- 
Connection from the connection pool just 
discussed. The close() releases the Db- 
Connection back into the pool. If the 
close() method is not invoked, I eventu- 
ally run out of DbConnections in the pool. 
That is why I always use the Transaction 
class in the iry...finally clauses, as in List- 
ing Two. 


The Python Scripts 

The mksel.py script generates SQL Select 
JDBC calls, while the mkupd.py script gen- 
erates the SQL Insert/Update/Delete JOBC 
calls. The Select queries are treated dif- 
ferently from Insert/Update/Delete queries 
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because Select queries return multivalue 
result sets vis-a-vis the single integer row 
count returned by the Insert/Update/ 
Delete queries. 

Each query is specified in a Python tu- 
ple: Gava-class-name, augmented-sq]). 
The java-class-name states the desired 
Java class of the query. The augmented- 
sql contains the SQL statement augment- 
ed with Java hints. A little parser works in- 
side the scripts to parse the augmented-sql 
queries. Mainly, it looks for the pattern 
[x:y], where x and y are strings enclosed 
in the square brackets and separated by 
the colon character. The x token is in- 
terpreted as a SQL expression. The y to- 
ken is taken to be a Java type declara- 
tion. I have added a bytes type to mean 
bytel/. | have also handled the boolean 
type by mapping the value to the strings 
t or fin the database and back. 

For example, Listing Four shows the Se- 
lectEmployeeByld query. First of all, no- 
tice that I named the query SelectEm- 
ployeeByld. Also, each result row of the 
query contains the employee’s ID 
(matched to a Java long 'id'), the em- 
ployee’s name (matched to a Java String 
‘name'), the employee’s salary (matched 
to a Java float 'salary'), and employee’s 
hire date (matched to a Java Timestamp 
‘bireDate'). Finally, the parameter to the 
query is a Java long ‘id’. 

Listing Five is the InsertEmployee 
query. Likewise, I named the query Jn- 
sertEmployee, which takes as parameters 
a Java long ‘id’, a Java String ‘name’, a 
Java float 'salary', and a Java Timestamp 
‘hireDate'. 

Running the scripts result in the cre- 
ation of the files SelectEmployeeByld.java 
and InsertEmployee.java. You may want 
to read the generated code to get a deep- 
er understanding of how it all works. 


Instrumentation 

The Statistic class associates a name to a 
set of numbers representing the invoca- 
tion count, minimum execution time, max- 
imum execution time, and total execution 
time. Internally, the Statistic class saves all 
its instances in a static sorted tree map 


based on the name; this map facilitates a 
search on all statistics in the system by 
name. To keep track of new statistics, I 
simply instantiate a Statistic object by giv- 
ing it a name. The constructor of the Statis- 
tic class inserts the object into the Statis- 
tic map. From then on, I can add samples 
to the Statistic object by calling the addZ 
method, or reset the Statistic object by 
calling the clear() method. The Statistic 


The framework 
results in 
considerable code 
savings, not to 
mention improved 
type checking 





map is mainly used in finding instances 
of the Statistic object by name. This usu- 
ally occurs when I need to dump some 
statistics to the screen. 

The framework uses the Statistic class 
to instrument each database query. Be- 
cause each query has a unique Java path 
name, I use this name to instantiate a stat- 
ic Statistic object for each query. For each 
invocation of the query, I calculate the 
elapsed time for the query and add this 
sample to the Statistic object. I usually 
code a dump function to print out all 
query statistics from time to time or on 
demand. The printout is helpful during 
debugging and performance evaluation 
as it shows how many times each query 
is invoked and how long it takes. The 
printout often points immediately to a sub- 


par query that results in a full table scan 
in the database. 


Limitations 

There are, of course, limitations to the 
framework. When these times come, you 
can work around the problem by getting 
the raw JDBC Connection object from the 
DbTrans object and issue JOBC commands 
directly. An obvious limitation is the stored 
procedure. The framework does not sup- 
port stored procedures because I did not 
find a need for it. Another limitation is in 
streaming. This concerns the getXStream() 
methods of the java.sql.Statement class. 
JDBC stipulates that when using streams, 
the stream content must be emptied be- 
fore the next getX() method is invoked. 
I could not find a way to handle this grace- 
fully, and thus, have elected not to han- 
dle it at all. Yet another limitation is when 
the query has a variable number of pa- 
rameters. 


Conclusion 

I use Python primarily for its text pro- 
cessing capability and its readability. There 
is no question that the same thing could 
have been done with Perl or, for that mat- 
ter, Java. But the code would not look as 
clean as Python. 

A beneficial side effect that comes di- 
rectly from the use of the framework is the 
fact that all your queries are now collect- 
ed in the two Python scripts and not scat- 
tered all over; there are only two places to 
look for queries that have gone awry. 

Generating the JDBC codes have re- 
sulted in considerable increase in pro- 
ductivity in the projects I was involved in. 
The ease of generating SQL queries, and 
the associated benefits that come with it, 
allow developers to quickly complete the 
data-access layer for any new project, and 
do so with less bugs. 
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Listing One 


del = con.prepareStatement ( 
"delete from EMPLOYEE where id = ?"); 


PreparedStatement ins = null, upd = null, sel = null, del = null; 
Connection con = DriverManager.getConnection(url, "mylogin", "mypasswd") ; ins.setLong(1, 1000); 
ins.setString(2, "John Smith") ; 


boolean commit = false; 

try { 
Calendar c = Calendar.getInstance() ; 
c.clear(); 
c.set (2000, @, 1); 


ins = con.prepareStatement ( 


"insert into EMPLOYEE(id, name, salary, hire_date) " + 


" walues- (7, 75%... 2)*)3 

upd = con.prepareStatement ( 
"update EMPLOYEE set salary = ? " + 
"where id = ?"); 

sel = con.prepareStatement ( 
"select id, name, salary, hire_date " + 
"from EMPLOYEE " + 
"where id = ?"); 
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ins.setFloat(3, 100000); 


ins.setTimestamp(4, new Timestamp(c.getTime().getTime())); 


ins.executeUpdate() ; 


System.out.println("inserted 1000 John Smith 100000") ; 


upd.setFloat(1, 120000) ; 


upd.setLong(2, 100); 
upd. executeUpdate () ; 


System.out.println("updated 1000 120000") ; 


sel.setLong(1, 1000); 


ResultSet rs = sel.executeQuery(); 


try { 
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} 


if (rs.next()) { 
System.out.println("selected " + rs.getLong(1) +" " 
+ rs.getString(2) +" " 
+ rs.getFloat(3) +" " 
+ rs.getTimestamp(4)) ; 
} 
else { 
System.out.println("error: cannot find employee id 1000"); 


} 
finally { 

rs.close(); 
} 
del.setLong(1, 1009); 
del.executeUpdate() ; 
System.out.println("deleted 1000") ; 


commit = true; 


finally { 


} 


if (ins != null) 
ins.close(); 
if (sel != null) 
sel.close(); 
if (upd != null) 
upd.close() ; 
if (del != null) 
del.close(); 


if (commit) 
con.commit(); 

else 
con.rollback(); 

con.close(); 


Listing Two 


boolean commit = false; 
DbTrans trans = new DbTrans(); 
try { 
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Calendar c = Calendar.getInstance() ; 
c.clear(); 
c.set (2000, 0, 1); 
InsertEmployee.execute(trans, 1000, "John Smith", 100000, 
new Timestamp(c.getTime().getTime())); 
System.out.println("inserted 1000 John Smith 100000") ; 


UpdateEmployeeSalaryById.execute(trans, 120000, 1000); 
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} 


System.out.println("updated 1000 120000") ; 


SelectEmployeeById.Row row = SelectEmployeeById.getOneRow(trans, 1000); 
if (row != null) { 
System.out.println("selected "+ row.id + "" + row.name +" " 

+ row.salary + " " + row.hireDate) ; 
} 
else { 
System.out.println("error: cannot find employee id 1000"); 


DeleteEmployeeById.execute(trans, 1900); 
System.out.println("deleted 1000") ; 


commit = true; 


finally { 


trans.close(commit ? trans.COMMIT : trans.ROLLBACK); 


System.out.println("Statistics dump:"); 
Statistic. dump (System. out) ; 


e e 
Listing Three 
/*.for oracle database */ 
DbConnection.init(20, "devi", "devi", 


"jdbe:oracle:thin:@192.168.123.7:1521:SAMPLEDB", 
"oracle. jdbc.driver.OracleDriver") ; 


/* for db2 database */ 
DbConnection.init(20, "devi", "devi", "jdbe:db2:SAMPLEDB", 


"COM. ibm.db2.jdbc.app.DB2Driver") ; 


Listing Four 


( 


"SelectEmployeeById", '''\ 


select [id:long id], [name:String name], [salary:float salary], 


[hire_date:Timestamp hireDate] 
from EMPLOYEE 
where id = [?:long id]''') 


Listing Five 


( 


"InsertEmployee", '''\ 


insert into EMPLOYEE(id, name, salary, hire_date) 
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values ([?:long id], [?:String name], [?:float salary], 
[?:Timestamp hireDate])''') 
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AppForge: Visual Basic 








Rapid development for 
the Palm platfo 


Clayton E. Crooks II 





sing the Palm Software Developers 

Kit (SDK) can be a daunting expe- 

rience. Not only does the SDK re- 

quire you to know C programming, 
but it is challenging in terms of new API 
references and memory management. The 
AppForge development environment, on 
the other hand, can help out by leverag- 
ing Visual Basic’s ease of use. 

AppForge (http://www.appforge.com/) 
was developed as an add-on to the stan- 
dard Windows version of the Visual Basic 
6 Integrated Development Environment 
(IDE). AppForge is available in a profes- 
sional edition and a standard edition, which 
is downloadable for a free 30-day evalu- 
ation. AppForge for Visual Basic integrates 
directly into the Visual Basic IDE as an 
add-on and consists of three components: 


e A compiler and utilities for packaging 
and transferring compiled applications 
to devices. 

e Special ActiveX controls called “Ingots” 
that replace the standard VB intrinsic 
controls when a Palm application is be- 
ing developed. 

e The Booster run-time engine, which is 
freely available to anyone who wants to 
run an AppForge application on their de- 
vice. The AppForge Booster replaces the 
standard VB run-time DLL and consists of 
native code that runs AppForge applica- 
tions on a Palm OS-based mobile device. 


As Figure 1 illustrates, an AppForge 
program is similar to any application when 


Clayton is an independent developer and 
can be contacted at crooks@planetc.com. 
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being developed in the VB IDE— except 
for the extra controls in the Toolbox (the 
Ingots), which are the size of the form 
that is created by default. At 160x160 pix- 
els, the size is the same as a standard Palm 
Pilot screen. 


Creating an Application 

To illustrate AppForge development, I’ll 
present a program that uses a variety of 
AppForge features, including a database 
and several AppForge Ingots. Once com- 
pleted, the program lets you track basic 
information about job times and signa- 
tures for projects that you are working on. 
You can walk through the development 





of this project using the 30-day trial ver- 
sion of AppForge. 

The only limitation to the functional- 
ity is the 30-day time limit. The complete 
source code and related files are avail- 
able electronically; see “Resource Cen- 
ter,” page 5. 

Once you've installed AppForge, start 
Visual Basic and from the Projects Type 
window select AppForge Project. You will 
be presented with a toolbox that has the 
AppForge Ingots already available along 
with the standard VB intrinsic controls. 
The standard VB controls cannot be used 
in a Palm project, but are displayed in 


Dr, Dobb's Journal, August 2001 


for the Palm OS 


the toolbox. As a reminder, if you try to 
place a standard control on the form, you 
are warned by the IDE that you cannot 
use it. Along with the toolbox, a form is 
displayed and ready for you to begin 
working. 


The GUI 

Like most VB applications, the first step 
in the project is to create the basic lay- 
out of a GUI. You can begin by placing 
several AppForge Label Ingots on the 
160x160 form that was created when you 
began the project. When you place the 
first Ingot on the form, you are asked 
to save the project before continuing. 
Save the project in an empty directory 
of your choice. The directory will also 
be used later to store the database files. 
The project needs five Labels with the 
captions Record, Date, Time, Project 
Name, and Signature. An additional La- 
bel needs to be created to store the 
record information, and as such, should 
have its caption changed to a blank 
space. The GUI should look something 
like Figure 2. 

The next step is to create three Text 
Box Ingots and set their properties as 
ixtName, txtDate, and txtTime. These In- 
gots display information contained in a 
database and should have their captions 
set to empty space. 

One special AppForge Ingot is the Sig- 
nature Ingot. As its name suggests, it’s 
used for storing information the user 
draws or writes on the screen. This ap- 
plication uses one of these for a user sig- 
nature. Place a single Signature Ingot on 
the form. 

Lastly, you need to create six Button 
Ingots that perform a variety of input op- 
tions. Create the buttons with the prop- 
erties in Table 1. The final GUI should 
look like Figure 3. 


Database 

The next step is the development of a 
database on the Windows desktop. Using 
Access 2000, create a database with the 
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fields in Table 2. After the Access database 
is completed, save it using the name Sig- 
Time.mdb. Next, use the AppForge 
Database Converter (installed with App- 
Forge) to convert the Access MDB 
database to a Palm database (PDB). The 
converter generates the Palm database 
along with a BAS file that you can use in 
the VB project. The BAS file contains the 
entire basic framework for accessing and 
using the Palm database on the desktop 
or Palm device, and should be saved to 
the directory you selected for the project 
in an earlier step. 


Creating the Code 

With the database and BAS files created 
Figure 1: The AppForge IDE. and copied to the same directory, begin 
working on the rest of the program. The 
first thing to do is to add the BAS file, 
which was generated by the AppForge 
Database Converter, to the VB project. 
This can be via the Project | Add Module 
in the Visual Basic menu. The module con- 
tains all the necessary code for directly 
accessing the Palm database, leaving you 
responsible for adding the appropriate 
code in the Visual Basic events. 

When the AppForge Button Ingots are 
clicked, an event is raised as with any VB 
control. Using these events, it’s easy to 
add records, navigate the existing records, 
or delete records. Additionally, depending 
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Figure 2: Creating the Palm GUI 
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Less effort. Fewer bugs. The first time. 
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e e 
Listing One 
Private NewRecord As Boolean 
Private Sub btnBack_Click() 
PDBMovePrev dbSigsTime 
DisplayInfo 
End Sub 


Private Sub btnExit_Click() 
Unload Me 
End Sub 


Private Sub btnNext_Click() 
PDBMoveNext dbSigsTime 
DisplayInfo 

End Sub 


Private Sub btnDelete_Click() 
If NewRecord Then 
ClearDisplay 
NewRecord = False 
DisplayInfo 
ElseIf PDBNumRecords(dbSigsTime) > @ Then 


SYSTEMS 


DEVELOPERS 


Santa Fe, New Mexico 


Prediction Company is a small firm 
utilizing the latest forecasting and 
systems technologies for prediction and 
automated trading of _ financial 
instruments. We have a successful track 
record and the backing of a large 
international financial institution. 


Our offices are located in beautiful 
Santa Fe, New Mexico, a city known for 
its multi-cultural heritage, artistic scene 
and outdoor lifestyle. Biking, hiking, 
camping, rock climbing and skiing are 
all possible in the immediate area. 


The successful candidates for these 
positions have at least a BS degree in 
computer science or other technical 
discipline, broad experience in 
specifying, designing and implementing 
software systems and_ significant 
experience with object-oriented lan- 
guages and systems. The development 
environment is Unix, C++ and S. 


Highly competitive compensation 
package with medical benefits fully 
paid. Annual bonus. 401(k). 30 paid 
days off a year. Paid relocation. 
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Table 1: Creating buttons and their — 


website: www.predict.com 


email: hr-recruit@predict.com 








_ (continued from page 58) 


on the button, you assign database infor- 
mation to the appropriate AppForge In- 
got that is on the form. Listing One is all 
you need for this project. 


Testing the Application 

Perhaps the best feature that AppForge 
provides is the ability to test the applica- 
tion from within the VB IDE without the 





need of an emulator or device. You can 
simply run it like any VB application and 
test the program functions. If you would 
prefer to download the application to a 
device or the Palm OS emulator (http:// 
www.palm.com/), you first need to com- 
pile it and then copy both the Booster run 
time and the Palm executable. 


Conclusion , 

With the popularity of the Palm exceeding 
that of Windows CE and other PDA OSs, 
it’s important to have a variety of devel- 
opment tools to choose from. AppForge 
for Visual Basic offers everything you need. 
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PDBDeleteRecord dbSigsTime 
DisplayInfo 
Else 


MsgBox "No records to delete" 


End If 
End Sub 


Private Sub btnNew_Click() 
NewRecord = True 


1lblRecordDisplay.Caption 


txtDate.Text = '"" 
txtTime.Text = "" 
ClearDisplay 

End Sub 


Private Sub btnSave_Click() 


= "New" 


Dim MyRecord As tSigsTimeRecord 


If NewRecord Then 


PDBCreateRecordBySchema dbSigsTime 


End If 


DisplayInfo 
End Sub 


Private Sub Form_Load() 


#I£ APPFORGE Then 


#Else 


#End If 


End If 


DisplayInfo 
End Sub 


Private Sub DisplayInfo() 


NewRecord = False 


Mail cover letter and resume to: Else 


PREDICTION COMPANY 
236 Montezuma Avenue 
Santa Fe, NM 87501 


)) PREDICTION 


COMPANY End Sub 


NewRecord = True 


ClearDisplay 
End If 
End Sub 


Private Sub ClearDisplay() 
sigName.Clear 
txtName.Text = '"" 
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PDBEditRecord dbSigsTime 
WriteSigsTimeRecord MyRecord 
PDBUpdateRecord dbSigsTime 


If OpenSigsTimeDatabase 


PDBMoveFirst dbSigsTime 


MyRecord.Name = txtName.Text 
MyRecord.Signature = sigName.SignatureData 
MyRecord.Date = txtDate.Text 
MyRecord.Time = txtTime.Text 


= False Then 


dbSigsTime = PDBCreateDatabase("SigsTime", 


SigsTime_TypeID, SigsTime_CreatorID) 


dbSigsTime = PDBCreateDatabase(App.Path & "\SigsTime", 


SigsTime_TypeID, SigsTime_CreatorID) 


PDBCreateTable dbSigsTime, "Signatures", 


"Name String, Signature String" 


Dim MyRecord As tSigsTimeRecord 

If PDBNumRecords(dbSigsTime) > @ Then 
ReadSigsTimeRecord MyRecord 
txtName.Text = MyRecord.Name 
sigName.SignatureData = MyRecord.Signature 
txtDate.Text = MyRecord.Date 


txtTime.Text = MyRecord.Time 


lblRecordDisplay.Caption = CStr(PDBCurrentIndex(dbSigsTime) + 1) + 


"of " + CStr(PDBNumRecords (dbSigsTime) ) 


lblRecordDisplay.Caption = "@ of @" 
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FOR JSAVA 


Formula One for Java automates 
and Web-enables the most common 
spreadsheet-based reporting and 
charting tasks in financial and cal- 
culation-intensive industries: 


e Embed powerful reporting and chart- 
ing functionality in JSP servlets, and 
server-based Java applications. 


Dynamically generate and distribute 
richly formatted Excel reports over 
the Web with formulas, charts, for- 
matting, outlining, and many other 
interactive features as opposed to 
simple, bland CSV or tab delimited 
files. 


Provide a robust spreadsheet inter- 
face for presenting and manipulating 
data in applets deployed to browsers 
or Java desktop applications — with 
no reliance on Excel or Windows. 


Perform scalable calculations on 
servers by leveraging spreadsheet 
functions and formulas. No need to 
write your own algorithms or calcula- 
tions. 


Deliver interactive Web charts with 
Clickable areas for "drilling down" 
into data. 


Utilize existing Excel spreadsheets as 
templates for reports and charts 
instead of hard-coding reports in 
Java. 


Access XML, databases, text files, 
and other server-side data sources. 


Read, edit, and manipulate existing 
Excel spreadsheets in Java. 
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Try Formula One for Java in your 
next reporting application 

for 30 days. Visit our site for live 
demos, sample code, and 
download at www.tidestone.com. 





EMBEDDED SYSTEMS 


Object-Oriented Device 
Networking 








Building complex 
systems using simple 
devices 





Michael Howard 


evice networking is the focus of many 

current development efforts, with ini- 

tial efforts focusing on remotely mani- 

pulating and monitoring newly con- 
nected devices. This is appropriate when 
the intent is to put users directly in control 
of the device. However, the problem is that 
billions of these new devices are built ev- 
ery year and many of them need to be part 
of larger intelligent systems. Clearly, archi- 
tectures for device networking must pro- 
vide a framework for creating complex sys- 
tems from simple, standardized building 
blocks — and object-oriented design is the 
key to making this happen. In this article, 
I'll present an object-oriented approach that 
is an alternative to the traditional ways of 
designing networked devices. 


The Opportunity 

When I started working in the device- 
networking field four years ago, the task 
at hand seemed straightforward. We want- 
ed to leverage the massive amount of 
Internet-related software R&D by making 
systems that connected web browsers to 
embedded devices. These devices were 
often built around simple microcontrollers 
in extremely cost-sensitive applications. 
Even though the microcontrollers were 


Michael is chief technology officer for 


emWare. He can be contacted at mhoward@ 
emware.com. 
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low cost, the value in being able to mon- 
itor and manipulate the devices was of- 
ten surprisingly high. 

We identified several areas where net- 
working added value to previously isolated 
embedded systems. These areas included: 


e Inexpensive user interfaces. A commu- 
nications port can link a device appli- 
cation to a browser-based interface. This 
interface is significantly cheaper than an 
LCD screen, with its many buttons and 
user manuals. 


e Remote access for control and moni- 
toring. With each new design that in- 
corporates a microcontroller, there needs 
to be access to information stored in the 
microcontroller. 

e Integration of web-based information 
sources with device applications. There 
are many instances in which devices can 
benefit from having access to informa- 
tion from the outside world. Living in a 
desert, my favorite example is having a 
sprinkler controller use weather forecast 
information to avoid*watering before or 
during rainfalls. 
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e Distributed control systems. Many appli- 
cations of device communications involve 
making several computer systems act like 
one. This type of application is often mo- 
tivated by the desire to reduce wiring 
costs to a central controller. Another ex- 
ample is when a single system, say a 
complex commercial air-conditioning sys- 
tem, must be built from a set of unknown 
components. | 


Initial Efforts 

To address this market, we created a 
device-resident server that makes a set of 
functions, variables, and events available 
for remote manipulation via web-based 
clients. In effect, each device is encapsu- 
lated as a single device object in a web- 
based proxy server. The device object can 
be manipulated from remote interfaces 
using C, Java, or ActiveX APIs. The device 
object had a complete description of each 
function, variable, or event, but there was 
no attempt to create device types. 

Initial customer interest came from ser- 
vice applications in which technicians 
walked up to a piece of equipment and 
used our software and a Java service ap- 
plet to diagnose a complex system. Web 
technology made this type of application®™ 
really cool because when the local tech- 
nician had problems with a device, our 
software made the same diagnostic infor- 
mation available to an expert service tech- 
nician located at the corporate office. The 
expert was available online to help the lo- 
cal technician with particularly difficult 
maintenance and service procedures. 


Problems Start Coming Up 

Since using web technology to connect 

people to networked devices seemed rea- 

sonable, I patted myself on my back and 

believed — for a while, anyway— that I 

understood the problem space. 
(continued on page 66) 
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(continued from page 63) 

As we moved forward to extend the 
Internet to microcontroller-based devices, 
however, I realized how massively out- 
numbered we are by the machines 
around us and couldn’t imagine interact- 
ing with so many devices through indi- 
vidual UIs. The last thing I want is to have 
to pay attention to more details, so a web 
interface for each microcontroller-based 
device in my house seemed less and less 
like a good idea. 

Other problems that arose with this 
paradigm included: 


e Diversity of client devices and time to 
custom code and test UIs. 

e Although there are many kinds of de- 
vices, device security and management 
functions should be similar— but there 
is no consistent method for encapsulat- 
ing common functions. In fact, many 
types of devices fall into general cate- 
gories; therefore, the notion of object 
interfaces needs to be supported. 

e Most intelligent devices have static be- 
havior. Imagine coding thousands of 
specific requests for information into de- 
vices, then parsing the responses and 
acting on them. Now add the restriction 
that whatever the device does, it will 
continue to do even if the information 
sources change. 


Although our initial development efforts 
solved the connectivity problem, the lack 
of typing and any sort of behavior con- 
tract created serious problems. These 
problems made it difficult to build larger 
systems. 

Currently, I’m focusing on automation. 
Consequently, what I’d like to have is a 
device network infrastructure that is au- 
tomated in terms of configuration, main- 
tenance, and operation. The result should 
be that any information on a device is 
accessible by network resident applica- 
tions through standard database APIs. 
The database APIs are also used to af- 
fect the state of many devices, not just 
a single one or a single type. The net- 
worked devices are also logically con- 
nected to each other. When one device 
has information relevant to the opera- 
tion of another, this information is trans- 
ferred according to predefined rules for 
interaction. 

The problem space for this approach is 
divided into problems of representation 
and coordination. I want to maintain a 
network-based representation of a device. 
This representation is simply a replicated 
copy. It reflects the state of the device. 
When I manipulate the copy, the device 
state changes as well. When I talk about 
“device coordination,” I’m referring to the 
creation of distributed systems. I call it 
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“coordination” because the systems can 
be loosely coupled and one device is con- 
trolling the state of another device. For 
example, because a telephone possesses 
intelligence, it can send messages in- 
structing stereos or TVs to turn down their 
volumes when the phone is ringing, there- 
by eliminating missed calls. 


The Traditional Approach 
Manufacturers have developed thousands 
of devices with connectivity in mind, and 
most have incorporated RS232 ports with 
custom protocols. Although these pro- 
prietary protocols may implement com- 
mands for diagnostics, data logging, and 
device control intended for use by other 
devices, their level of interoperability with 
disparate networks is limited. Of course, 
the benefit of this approach is that it is 
simple and relatively inexpensive. How- 
ever, the problem with devices that uti- 
lize this limited form of connectivity is 
that they don’t incorporate the concept 
of Design by Contract (for more infor- 
mation on DbC, see http://www.elj 
.com/eiffel/bm/ot-dbc/). Thus, these 
devices and systems are not scalable, and 
cannot integrate higher level systems. 
Someone has to look at each and every 
device type and create a translation mod- 
ule. This is not a simple process because 
of the lack of protocol standardization 
and abstraction. 


The Object-Oriented Approach 

With this in mind, we are pushing to cre- 
ate uniform interfaces to object repre- 
sentations of devices and use standard 
class definitions to describe similar de- 
vices. By doing this, the task of creating 
useful software applications that incor- 
porate device information can be de- 
coupled from the task of getting that in- 
formation from the device. Additionally, 
the device descriptions represent a be- 
havioral contract, which is a prerequisite 
to any real progress in the automation 
of device network configuration and 
management. 

Our current effort starts with a system 
that lets you encapsulate a device into 
one or more objects. Each of these ob- 
jects is essentially an instantiation of a 
predefined device-related class. Typical 
implementations include class imple- 
mentations for management, general de- 
vice application, and a subclass for pro- 
prietary extensions. The management 
class implements features used for 
firmware upgrades, capabilities discov- 
ery, device identification, and performance- 
related tuning. The class that implements 
general device application is essentially 
the feature set of the virtual base class 
for the specific type of device. For ex- 
ample, a networked furnace would have 
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a basic set of properties used for main- 
tenance, diagnostics, and control. This 


class definition would be one defined by 


and agreed upon by a standards group 
such as the American Home Appliance 
Manufacturers (http://www.aham.org/), 
whose charter is to promote and define 
standards that allow interoperability of 
networked appliances. 


Interfaces 
At first glance, the encapsulation of an em- 
bedded device application into an object 
appears trivial. The need for industry-wide 
device specification is apparent when you 
look at the support that Universal Plug- 
and-Play (http://www.upnp.org/) is get- 
ting. Given a way to let the device de- 
scribe which classes and interfaces are 
implemented, and a protocol to allow ma- 
nipulation of the device objects, the de- 
sired automation should be within reach. 
But problems start creeping from sev- 
eral different factors: 


e A device manufacturer adds product fea- 
tures to differentiate their product from 
other manufacturers. If these features 
are to be available to other devices, then 
a combinatorial explosion occurs, which 
removes the usefulness of the objects 
for software that relies on uniformity. A 
class definition based on the feature set 
of the lowest common denominator al- 
lows the notion of polymorphism at a 
system level. An interface can be writ- 
ten to this object. The question remains 
whether treating all devices the same 
offers enough value from interoperability 
to prevent manufacturers from ignoring 
standardization efforts. 

e Different aspects of a device’s opera- 
tion will be exposed as different ob- 
jects or interfaces within an object. 
However, each of these is accessed 
through a communication link that is 
often bandwidth limited. This brings up 
a host of quality of service issues, in- 
cluding questions of timing: Do the 
class definitions need to specify re- 
quired responsiveness of the objects be- 





Figure 1: Connecting objects. 
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ing used? Will spurious delays break 
some applications? 


Inheritance 

Inheritance in the device-networking realm 
is an interesting feature. The implemen- 
tation of inheritance that I need is a dis- 
tributed inheritance. The base class of a 
device object is implemented on a larger 
device gateway program; the subclass is 
implemented on the product. The gate- 
way device then further subclasses the de- 
vice. This implementation lets you control 
the final presentation of an object from a 
device gateway. 

The distributed class hierarchy that im- 
plements this type of inheritance allows 
you to create a device application that is 
based on services that are available through 
a supporting system such as a device gate- 
way. Take, for example, a home-heating 
and air-conditioning (HVAC) system. At its 
simplest, this is a system composed of a 
heating unit, chiller, and thermostat. If the 
devices were connected via different trans- 
ports to a home gateway, then the gate- 
way would provide services to the appli- 
cation running in the thermostat for 
network connectivity. The method invo- 
cations of the HVAC control to raise the 
temperature would result in invocations in 
the gateway-resident base class, which are 
then forwarded to the other subnetwork. 
This may be superior to having the ther- 
mostat application smart enough to deal 
with networking issues and encoding the 
data communications into the application. 


Connecting Objects 

Given a device network of objects, the 
next step is to create bindings between 
the objects. The approach I first thought 
to take was to create bindings between 
the properties, methods, and events of var- 
ious objects. As Figure 1 shows, there are 
a number of ways you can approach this. 


e Make a set of device objects and in- 
terfaces available from each device. 
Then make that set available to some 
configuration process. The configuration 
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process then determines what the over- 
all interaction should be and binds the 
methods and properties of the objects as 
appropriate. If the underlying commu- 
nications stack does not allow peer-to- 
peer communication, then the binding is 
accomplished through an intermediary 
node that acts as the bus master and 
spends a lot of time copying data around. 
Implement the interactions between de- 
vices adding remote object manipula- 
tion features to the device, then pro- 
gram the device applications to use 
remote device objects. For example, a 
thermostat could look up all available 
furnaces, bind to the appropriate one 
(magic happens here), then manipulate 
the furnace object directly. 

Implement a notion of object input for 
the needed objects, but let some exter- 
nal module control the remote object 
interaction and binding. Using this sys- 
tem, the device is not the only thing 
typed. The input to the device is from 
binding with a specific type of object. 
This is stronger than simply binding to 
a property of product because the type 
of the device supplying this input is en- 
forced. The idea here is to keep the ap- 
plication developer from having to deal 
with the complexity of network config- 
uration and management. Instead, the 
device firmware is created with a set of 
predefined interactions with other 
known devices. This is a scheme that I 
think is appropriate in a situation where 
there is a network coprocessor or a cen- 
tralized device gateway; see Listing One. 


In Figure 1(a), the user of the Hot Tub 
object knows that it is manipulating the 
Hot Tub, so the interaction is assumed ap- 
propriate. The developer of the Hot Tub 
enforces correct iteration with the equip- 
ment through the class definition, but trusts 
that users of this object will manipulate it 
appropriately. In Figure 1(b), the con- 
nection is between the implementor of an 
Occupant Status class, which describes the 
status of home occupants. The object state 
reflects that the occupant will not be 
around for the day. The desired actions 
resulting from this status are not the re- 
sponsibility of the home controller. Final- 
ly, in Figure 1(c), the Hot Tub object is 
created to use an Occupant Status object 
if it is connected. If so, the behavior is 
made to place the equipment in the ap- 
propriate mode, given the status infor- 
mation supplied. 

In all these scenarios, the binding phase 
presents a problem. The desire is to have 
a completely automatic plug-and-play type 
of system. The reality is that the binding 
of devices is often difficult or impossible 
to resolve automatically. Take, for exam- 
ple, a case where a consumer has two fur- 
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naces and two thermostats. How do these 
nodes know which to bind? The process 
of binding devices into control systems 
must be accomplished by: 


e Discovery of available devices. 

e Designation of suitable bindings. 
e Binding the devices. 

¢ Confirmation of correct binding. 


One or more of these stages can be au- 
tomated or skipped, but I have never seen 
a system where all these steps can always 
be skipped. One of the hurdles comes 
when the device subnetwork transport 
overlaps with subnetworks owned by oth- 
er consumers. It would be a real problem 
to bind a smoke detector with RF output 
from one house into another. 

With an object-oriented approach, de- 
vice networks can be built that are auto- 
matically configured and efficiently man- 
aged, but there is no magical property that 
can make what would essentially be a ran- 
dom guess into a wise choice. 


An Example 

The device network interface software that 
Listing One is written for lets each variable 
access to be implemented with an accessor 
function that controls the movement of data 
to and from the communications channel. 
This is why there is no apparent data move- 
ment from the outside world into this sys- 
tem. This example assumes that the class- 
es implementing Furnace are built from C. 
This is typical of control applications. 

If you look at the state transitions, the 
over-temperature fault, which happens 
when the filter gets too dirty, does not 
stop the furnace from proceeding to cool- 
down phase and eventual startup. This is 
indicative of how these types of devices 
must continue to operate as good as pos- 
sible even in the face of serious problems. 
That is why getting this valuable fault in- 
formation is so important to the consumer. 
A remote service tech can easily deter- 
mine the problem and even be alerted by 
the thermostat, which will know that its 
control is slipping. 

Any device that wishes to control the 
furnace must supply the HeatControl out- 
put. This is stronger typing than simply 
linking to any module that can control 
temperature. The devices or applications 
interested in fault and status may be a 
separate maintenance module, not just 
the thermostat. The meaning of the in- 
put/output functions are not ambiguous 
and leave little room for misunderstand- 
ing in terms of behavior resulting from 
communications. 


Extending the Contract 


Recall that I suggested devices might be 
implemented using inputs that are of cer- 
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tain class objects instead of using inputs 
from properties that are manipulated. This 
is an important choice because, on one 
hand, you get devices that cannot possi- 
bly be connected to inappropriate input. 
On the other hand, you risk losing flexi- 
bility and the ability to bind inputs to rea- 
sonable but unanticipated sources. 

Bidirectional contracts are good when 
the behaviors for given inputs are easy 
enough to design, but the inputs should 
be more strongly typed. If you view the 
inputs as classes, then this is done for you 
already because the contract is extended 
to include the appropriateness of the in- 
puts to the device. 

Device networking and object-oriented 
design is a natural fit because object- 
oriented design is premised on modeling 
real world systems and relationships. 
When embedded applications can be treat- 
ed as instances of standard device class- 
es, the devices can be integrated into larg- 
er, more automatic systems. To accomplish 
this automation, the device application de- 
velopers must not only create an instance 
of a device class, they must be able to 
structure the relationships of their appli- 
cation and other devices. Having this type 
of object-oriented approach to device net- 
working lets you vastly simplify the end- 
user’s environment by making devices 
work together without needing an end- 
user to be tied to the system. 


Conclusion 
As embedded microcontrollers find their 
way into more and more applications 
around us, we are becoming surrounded 
by systems that implement behaviors ap- 
propriate for their limited inputs. Many of 
these systems will have valuable infor- 
mation available to them that is never 
made available to the outside world. This 
is sometimes due to cost, location, or for 
the sake of simplicity for users. By adding 
networking capabilities, you can tie these 
isolated systems together into larger co- 
ordinated systems. The components of the 
larger systems may be created by differ- 
ent vendors, but should easily, if not au- 
tomatically, connect and work together. 
This requirement is familiar to software 
developers who have dealt with software 
interoperability issues for many years. The 
tool developed for this problem of inter- 
operability is object-oriented design. In 
device networking, an object-oriented ap- 
proach of encapsulating device capabili- 
ties into classes facilitates network con- 
figuration and provides strong typing 
required for ensuring the appropriateness 
of an interaction. However, the relation- 
ship created by merely encapsulating ca- 
pabilities, then using them elsewhere, does 
not sufficiently guarantee to the manu- 
facturer of a device that the device will 
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be used appropriately. I have suggested Applying object-oriented design to em- chanics of creating objects out of embed- 
that making input objects for the controlled bedded devices promises many of the ded applications must be addressed. 
systems will provide this guarantee and same benefits it delivers to larger software 

pave the way for automatic configuration. systems. To get these benefits, the me- DDJ 
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Listing One 

/* Heat Control Input. This input represents the heat on/off 
** control from a thermostat. 

** Emergency0ff 

** The emergency off input requires that any activity should 
** be terminated as soon as possible, but in a way that will 
** not damage equipment. In this system, the cool down cycle 
**k is shortened when this is present. 

** Heat 

** The heat input from the thermostat. The furnace must start 
#* heating as soon as possible and remain on until this is de-asserted. 


typedef struct { 
BOOL EmergencyOff; 
BOOL Heat; 
BOOL Economy ; 
} HeatControlIn; 
/* Furnace Status Output. 
** This is an output specific to Furnace device types. The 


** fault flag is used to indicate if there is a current fault to report. 


** FanOn - indicates whether the fan is currently running. 
** Heating - Indicates if the furnace is currently active. This 
2 is true during the phases of warm-up, heat, cool-down. 
** InternalTemp - The internal temperature. This value is used to 
** control the phase transitions as well as the 
i fault shut-down. 
* 
typedef struct { 
BOOL Fault; 
BOOL FanOn; 


BOOL Heating; 

TEMP InternalTemp; 

} FurnaceStatus; 
/* Fault Output. This is a generic equipment fault output. 
** The Reason field is an ascii text description of the fault. 
** The code is a device type specific fault value. The 
** tuple of (TypeId,code) will determine the fault meaning 
HY for other devices. 
* 
typedef struct { 

char *Reason; 

BYTE Code; 

DEVID Id; 

DevType Typeld; 

} FaultDesc; 


/* Sample State machine to go with these I/O on a furnace. */ 
/* check internal temperature, maintain status */ 


ee 


& develépiviéritor 
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status.InternalTemp = InternalTemp = ReadTemp() ; 


/* handle state transitions */ 
switch (FSTATE) 


case idle: 
if (! EmergencyOff) 


if (Heat) 
A: mea 


break; 
case starting: 
if (InternalTemp >= START_TEMP) 


SetFan (ON) ; 
ee = heating; 
break; 


case cooling: 
if ((EmergencyOff && (InternalTemp <= FAST_STOP_TEMP) ) 
i; (InternalTemp <= STOP_TEMP) 


{ 
SetFan (OFF) ; 
FSTATE = idle; 


} 
break; 
case heating: 
oY Heat) 


StopHeat () ; 
FSTATE = cooling; 
} 


ar ee etenr seep >= MAX_TEMP) 


StopHeat () ; 

FSTATE = cooling; 

status.Fault = TRUE; 

fault.Reason = "Internal temp exceeded, check filter"; 
fault.Code = FAULT_FURNACE_TEMP ; 

fault.Id = MyId; 

genre nyaets = Mytype; 


break; 
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icrosoft’s Active Servers Pages 
(ASP) and Sun’s Java Server Pages 
(JSP) are gaining popularity be- 
cause they unite client-side HTML, 
server-side scripting, and component- 
based development to produce dynamic 
pages. ASP supports a number of host 
scripting languages including Perl. The 
Perl module Apache::ASP provides an 
ASP port to the Apache Web Server with 
Perl as the host scripting language. So 
why would you want to develop your 
own ASP-like server-side scripting facil- 
ity if ASP is already available on Apache? 
The answer is simple— out of necessi- 
ty. In my case, I recently worked on a 
project to develop a performance re- 
porting add-on to enterprise network/ 
system management platforms including 
HP OpenView and Tivoli Netview. These 
products install their own web servers 
during the installation process. For ex- 
ample, HP OpenView installs an Apache 
Web Server when installed on HPUX and 
Solaris. (Either Microsoft Internet Infor- 
mation Server or Personal Web Server is 
used on Windows 2000 and NT.) How- 
ever, the Apache Web Server installed 
does not have the mod_perl module that 
is required by Apache::ASP to work. This 
posed a serious problem because the 
add-on product needed to work out-of- 
the-box. Customers shouldn’t be ex- 
pected to install a large number of oth- 
er Perl modules and devote considerable 
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time reconfiguring and extending their 
platform environment to make an add- 
on product work. The problem gets 
worse as the latest version of Netview no 
longer uses Apache as the default web 
server. Writing different versions of a 
web-based application for different web 
servers is not an option because of the 
development and maintenance effort. My 
solution, which I call “Perl Server Pages” 
(PSP), is to develop a small footprint Perl- 
based cross-platform JSP-like facility — 
with custom tag library support — for 
ed ee Pe cama he. A amweniaile  ageanain ds | 








generating dynamic pages. The only re- 
quirements are that your web server sup- 
ports CGI and you have Perl 5.005 or lat- 
er installed on your system. The complete 
source code and related files for both 
UNIX and Windows are available elec- 
tronically; see “Resource Center,” page 5. 

Note that there is also an open-source 
project that is also called Perl Server Pages 
(http://psp.sourceforge.net/). However, 
there is no connection between that pro- 
ject and the one I present here. 


PSP Features 


PSP is modeled after JSP. It is neither an 
ASP nor a JSP port. PSP includes many 
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JSP-like features and, most importantly, 
custom tag support. The latter gives you 
the ability to develop custom tag modules 
to encapsulate complex server-side be- 
haviors and business rules into simple 
XML-like elements that content develop- 
ers can use. 

PSP shares the same basic elements with 
JSP; see Table 1. However, PSP only sup- 
ports the directives in Table 2, although 
it does provide the built-in objects in Table 
3. PSP does not provide built-in session 
and application objects like JSP. The lim- 
itations of PSP are described in a later sec- 
tion. Listing One is a simple PSP page us- 
ing some basic PSP elements to display 
fonts with varying sizes. 

Like JSP, PSP lets you create custom 
tags. But unlike JSP, you are not required 
to provide an XML-based Tag Library De- 
scriptor (TLD) with each tag library you 
develop. 

Listing Two uses the custom tag 
<test::date>, which displays the current 
date and time according to the format 
specified in the format attribute. The 
<test::date> custom tag does not contain 
a tag body (an empty element) and it is 
terminated with a /. In general, a custom 
tag may or may not have a body. But it 
is always terminated either with a / or an 
explicit end tag. 

Listing Three is a PSP page that shows 
a more complicated example using nest- 
ed tags. It uses the custom tags <test:loop>, 
<test:if>, <test:condition>, <test:then>, and 
<test:else> to generate 10 random numbers, 
and displays either “head” or “tail” in an 
unnumbered list depending on whether 
the random number is greater than 0.5. 


Implementation 
PSP uses Perl’s object-oriented features 
exclusively. Like JSP pages, PSP pages 
need to be translated before use. 

The first thing that you need when 
implementing the PSP translator is a sim- 
ple parser that can handle HTML and 
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(continued from page 71) 
XML-like syntax. You don’t need some- 
thing complicated like HTML::Parser 
(http://search.cpan.org/doc/GAAS/ 
HTML-Parser-3.25/Parser.pm) that knows 
which tags can contain other tags, and 
which start tags have corresponding end 
tags. Nor do you need a parser that cre- 
ates a hierarchical tree of HTML content. 
All you need is a simple parser that rec- 
ognizes a small number of tags (mostly 
start and end tags) and parses the ele- 
ments into a list. HTML::SimpleParse 
(http://search.cpan.org/doc/KWILLIAMS/ 
HTML-SimpleParse-0.10/SimpleParse.pm) 
appears to satisfy these requirements. But 
on closer examination, it presents two ma- 
jor problems. First, it is not easily exten- 
sible to pick out elements not already rec- 
ognized by it. Second, it cannot handle 
tags with embedded > in them (</est:some- 
thing name="expr" value="x > y"/>, for 
example). These problems have been doc- 
umented by its author in the POD (Plain 
Old Document) for HTML::SimpleParse. 
I got around the first problem by chang- 
ing the module’s parse method to include 
a couple of new tag types that I wanted 
it to handle. I renamed the module 
PSP::SimpleParse. To get around the sec- 
ond problem, I created a subclass named 
PSP::Parser to override the text method 
that sets the content of the text to be 
parsed. Before it calls its parent’s text 
method, it replaces all single- and double- 
quoted strings with placeholders ~~NN~~, 
where NN is a unique number. The strings 
are saved in a hash. SimpleParse’s parse 
method is also overridden to reverse the 
transformation carried out by the text 
method. With these transformations, there 
is no need for SimpleParse to handle an 
embedded “>.” PSP expressions are an- 


other source for embedded “>” because 









Figure 1: PSP translator class diagram. 
they can appear anywhere in an HTML 
document. The solution here is to replace 
“<%=” with “&/t-%=”" and “%>” with 
““%&gt;”. The PSP translator takes care of 
the PSP expression code generation. 

Figure 1 is the class diagram for the PSP 
translator program that instantiates a 
PSP::Translator object to handle the trans- 
lation, a PSP::Trans Writer object to han- 
dle the translation output, and a PSP:: Writ- 
er object to handle error messages. Both 
PSP::TransWriter and PSP::Writer buffer 
output until the flush method is called. 

PSP::Translator’s. translate method in- 
vokes PSP::Parser’s file method to read in 
the PSP file, then its parse method to parse 
the text into a list. Translate then goes 
though each item in the list to pick out 
the six categories of tags that it needs to 
process: 


e TYPE_STARTENDTAG_S, which in- 
cludes most PSP elements — declara- 
tions, directives, and scriptlets. 

e TYPE_STARTENDTAG, which includes 
custom tags that don’t have a body as 
in <test:date />. The tag may contain 
PSP expressions that need to be pro- 
cessed. 


Table 1: PSP shares the same basic elements with JSP. 





Table 3: PSP supported builtin | objects. 
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e TYPE_STARTTAG, which includes all 
start tags: HTML and custom. Both kinds 
of tags may contain PSP expressions that 
need to be processed. The custom tags 
of this type all have a body. 

e TYPE_ENDTAG, which includes all end 
tags: HTML and custom. HTML start tags 
are passed straight through. Only cus- 
tomer tags are handled specially. 

e All other tags include text and other 
more exotic elements. Since PSP ex- 
pressions can appear in them, these tags 
are checked for the presence of PSP ex- 
pressions and handled appropriately, 
except for HTML comments. 


The translator relies on an external tem- 
plate file named “psp.tpl” (Listing Four) to 
format its output into a proper Perl script. 
All fields, except the keywords enclosed in 
~~ (~~BODY~~, for instance), are output 
as is. The keywords or placeholders are 
substituted by the content accumulated by 
the PSP::TransWriter. When a fatal error is 
encountered, PSP::Translator’s errorPage 
method is called to display an HTML error 
message before quitting. errorPage requires 
an external template file named “err- 
page.tpl.” Like psp.pl, it uses placeholders 
(keywords enclosed in “~~”) to mark where 
the error message should appear. These 
template files must be colocated in the same 
directory as the translators: pspt.pl and 
CGlIpspt.pl (described later). 

Translating the HTML and PSP basic el- 
ements into a Perl script is relatively straight- 
forward compared to providing custom tag 
support. Listing Five shows part of the trans- 
lated output of the PSP page in Listing Two. 
You may notice that there are a number of 
$out->print statements that print only blanks 
and new line characters in some translat- 
ed pages. It is simple to eliminate these 
statements in the translator. The only rea- 
son why this is not done is that the output 
will not be intended if <PRE> and </PRE> 
were used in a PSP. By eliminating blanks 
and new lines, preformatted text won't 
show up properly when displayed. 

The translated Perl script relies on the 
following classes or Perl modules to run: 

(continued on page 77) 
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e CGI, the built-in objects $request and 
$response point to the one and the same 
CGI object. I use CGI.pm for $request 
and $response because it is something 
most Perl developers are familiar with. 
By using CGI, they don’t have to learn 
yet another programming interface. For 
example, one uses CGI’s param method 
to retrieve CGI parameters, cookie method 
to create cookies, header method to set 
the HTTP header, and so on. 

PSP:: Writer, a buffered output class. 
Since the output is buffered, you can 
change the HTTP header at any time us- 
ing either the PSP include directive or ee >, 
$out->header and $response->header Extension tor Bluetooth™ 


methods to return an HTTP header. A , lp 
Pole TavareyZ-lalare Melantelacey Ae (Ue ie ve 
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tag handler class for handling the cus- 
tom tag <test:date> is test::date. 





A tag handler class or Perl module must 
be derived from PSP::TagHandler for each 
custom tag. The PSP::TagHandler class 
has the following methods: 


e new, a constructor. 

e doTagStart, a method you should over- 
ride to handle a custom tag. If the tag 
does not have a body, it should return 
a 0. If the tag has a body, you should 
put an initialization code here and re- 
turn a 1. In which case, the doBody 
method will be invoked next. 

e doBody, a method you should override 
to handle the manipulation of the tag 
body. You should return a 0 when you 
are done. Or you can return a 1 if you 
want to process the body again. 

e doTagEnd, a method you should over- 
ride to do any clean up after process- 
ing the custom tag. 

e vars, a method for you to create and 
share variables among related classes in 
nested custom tags. This method is par- : , i 
ticularly useful, if you intend to create a : | a chapter from 
complete markup language (like Cold- | “7 ~ JSP”, Serviets, and MySQL” 
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¢ errorPage, a method to create an HTML 
error page and terminate the executing 
Perl script. 


Your custom tag may contain one or 
more attribute=value pairs as in <test:date 
Sormat="%Y-%m-%d %a %H:%M:%S" />. 
In such a case, you need to provide a 
method called format, which is exactly 
the same as the attribute name, to handle 
the storage and retrieval of the value. 

Your custom tag handler can manipu- 
late the tag body by using the PSP::Body 
class. It has the following methods: 
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° new, a constructor. 

e request, which returns the in-built re- 
quest object. 

e response, which returns the in-built re- 
sponse object. 

e out, which returns the PSP:: Writer be- 
ing used by the current tag handler. 

e Body, which returns a string containing 
the entire tag body. 


The PSP::TagHandler and PSP::Body 
classes provide a simple interface to cre- 
ate custom tags. Figure 2 is the PSP en- 
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Listing Five shows part of the translat- 
ed PSP page for the simple PSP page that 
uses a custom tag to display the current 
date and time (Listing Two). Please note 
that all variables with prefix PSP_ are used 
internally by the PSP run time. You should 
not change them in any way. 

Each custom tag handler is contained 
in its own block, that is, between { and } 
and uses two PSP:: Writers, one for its en- 
closing tag and the other for the current 
tag. These writers are named $PSP_out 
and $out, respectively. The tag handling 
code then: 


1. Pushes these objects onto the stack 
($PSP_stack). 

2. Copies Sout to $PSP_out and creates a 
new $out PSP:: Writer object for han- 
dling the current tag’s output. 

3. Stores the custom tag in the variable 
$PSP_tag. 

4, Creates a new tag object by using the 
built-in PSP::TagObjFactory object’s Cre- 
ateTagObj method. The tag handler 
class name is derived from the tag 
name. It simply inserts an extra : in the 
tag name. For example, the tag handler 
class is test::date for the custom tag 
test:date. 

5. Calls the newly created tag object’s 
doTagStart method. 
Skips Step 7 if it returns a 0. 
Keeps calling the tag object’s doBody 
method until it returns 0. 
Calls the tag object’s doTagEnd method. 
Calls the built-in PSP::TagObjFactory 
object’s houseKeeping method to up- 
date the ancestry so that nested tag han- 
dlers can find their ancestors by calling 
the inherited findAncestorWithClass 
method. 

10.Restores the old values of $PSP_out 
and Sout by popping them from the 
PSP_stack. 


A more complicated piece of code re- 
sults when nested tags are involved. But 
it always follows the same pattern; that is, 
another tag handling block is inserted 
within the doBody loop of the enclosing 
tag handling code. 


Examples 
Listing Six contains the tag handler class 
for the <test:date> custom tag used in List- 
ing Two. It overrides the doTagStart 
method of its superclass PSP::TagHandler. 
Since it supports the format attribute, it 
also implements the format method. When 
called with a value, it sets the class data 
member /format'. When called without a 
value, it returns the stored /format' value. 
This is the simplest tag handler you can 
build because it does not have a tag body. 
Look at the tag handler classes used 
in the PSP page in Listing Three. Listing 
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(continued from page 98) 

Seven contains the fest::loop class. This 
time, test::loop overrides both its super- 
classes doTagStart and doBody methods. 
In doBody, it outputs its tag body and re- 
turns a 1 to repeat the operation for the 
number of times specified in the ‘repts’ 
data member. Also note that it implements 
the tag attribute handling method repts. 

Listings Eight, Nine, and Ten contain 
the test::if, test::condition, and test::then 
classes, respectively. The test::ifis the sim- 
plest class possible. It is an empty class 
that uses the PSP::TagHandler class with- 
out modification. 

Test::condition overrides the doTagStart 
method to use the findAncestorWithClass 
method to check if it is enclosed inside 
the <test::if> tag. It calls errorPage to gen- 
erate an HTML error message and termi- 
nates if an enclosing <test:if> tag is not 
found. DoBody retrieves the body; that is, 
the result in the evaluation of the speci- 
fied condition, and saves it in the enclos- 
ing test::if object’s hash for use by fest::then 
and test::else. 

Test::then’s doTagStart is similar to that 
of test::condition. Its doBody method 
checks if a condition has been stored in 
the enclosing ¢est::if’s hash. If yes, it re- 
trieves it and outputs the entire tag body, 
provided that the retrieved condition is 1. 
If there is no condition stored in the en- 
closing test:.:if object’s hash, it generates 
an HTML error message and terminates. 
The test::else class is almost the same as 
the fest::then class. Hence, it is not listed. 


Deployment 
There are two ways to deploy PSP 
pages— manual translation and automat- 
ic translation. 

You can manually translate all PSP 
pages using pspt.pl into Perl scripts and 
work exclusively with the translated PSP 
pages. All your HTML hyperlinks (bref 
and dction properties in forms) should 
point to the translated .pl files. The ad- 
vantage of this approach is that you are 
working with Perl scripts exclusively and 
do not have to worry about different web 
servers using different mechanisms to run 
a PSP page (discussed next). 

In the automatic translation approach, 
you configure the web server to use the 
CGIpspt.pl script to run your PSP pages, 
that is, pages with the .psp extension. If 
you are using Microsoft web servers (PWS 
or IIS), you do this by adding a new string 
value to the Windows registry: 


\HKEY_LOCAL_MACHINE\SYSTEM\ 
CurrentControlSet\Services\w3svc\ 
parameter\Script Map\.psp 


and give it a value: “C:\Perl\bin\Perl.exe 


C:\cgi-bin\CGIpspt.pl %s %s”. If the phys- 
ical path C:\cgi-bin has been mapped to 
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the virtual directory /cgi, after restarting 
the web server, you can reference your 
PSP page as: 


http://hostname/cgi/whatever.psp 


Unfortunately, the mapping mechanism 
is quite different. on Apache (even for 
Apache running on Windows platforms). 
You have to add the following entries in 
Apache’s srm.conf file: 


AddHandler psp-script .psp 
Action psp-script "C:/cgi-bin/CGIpspt.pl" 


After restarting the web server, you can 
reference your PSP page as: 


http://hostname/cgi/CGIpspt.pl/cgi/ 
whatever.psp 


The CGIpspt.pl script uses the environ- 
ment variable PATH_TRANSLATED to get 
the translated path name of the specified 
PSP page. In both cases, PATH_TRAN- 
SLATED contains C:/cgi-bin/whatever.psp. 
This means that HTML and PSP pages that 
contain links need to have the links mod- 
ified when moved from Apache to IIS or 
vice versa. This is a bit of a pain, but the 
problem can be resolved by writing a sim- 
ple Perl script to automate the process. 

Another potential problem is related to 
the placement of Perl and PSP modules 
(.pm files). You may need to change the 
“#!” and “use lib” statements in CGIpspt.pl, 
pspt.pl, and psp.tpl for your environment 
to make PSP work properly. Check the 
web-server log to get more information if 
PSP does not work. 

CGlIpspt.pl checks for the existence of 
the specified .psp file and displays an er- 
ror message if it is not found. It then 
checks if a translated version, that is, a file 
by the same name but with file extension 
.pl, is already present in the same direc- 
tory. If it is not present, CGIpspt.pl in- 
vokes a PSP::Translator object to create 
it. If it is present, CGIpspt.pl interprets it 
by using the eval function. Since eval uses 
the current environment to interpret the 
translated Perl script fed to it, all CGI pa- 
rameters are available to the translated 
script. If you should change a PSP page, 
you need to remove its old translated .pl 
file before the changes will be picked up 
during subsequent invocations. During 
the development cycle, you may want to 
comment out the part in CGlIpspt.pl 
where it checks for the presence of the 
.pl file so that it always translates the 
most current version. 

For better control of the PSP installa- 
tion process (no flames please), I depart 
from the CPAN way of installation using 
make. I’ve created two packages, one for 
Windows and the other for UNIX. Both 
packages are also available electronical- 
ly. The Windows package uses the usu- 
al Windows setup.exe to install. It sets up 
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the Windows registry for you automati- 
cally. This makes deployment easy if you 
are using Microsoft web servers. The 
UNIX version uses a shell script for in- 
stallation. 


Limitations 

PSP unites client-side HTML and server- 
side scripting in a small package that 
consists of eight small Perl modules. Most 
of them are around one page in size. 
The small package is packed with fea- 
tures like custom tag support. Because 
of its small size and independence of 
web-server features, PSP pages can be 
moved easily among different platforms. 
However, portability does not come 
without cost. 

Owing to its use of the CGI interface, 
PSP suffers from the same problems that 
plagued normal Perl CGI scripts (not 
counting using perl_mod with Apache): 


e Each Perl script invocation spawns a 
new process. This may limit the scala- 
bility of the web site. 

¢ No built-in session or state support. You 
have to rely on the usual mechanisms 
such as hidden fields, cookies, and URL 
rewriting to maintain state information. 


Other limitations, if you are deploying 
PSP using the automatic translation ap- 
proach, include: 


¢ CGlpspt.pl uses perl’s eval function. This 
feature is considered unsafe by some, 
security wise. 

e You need to give write access to your 
CGI directory or whatever directory you 
put the PSP translator and PSP pages in 
because the translator generates Perl 
scripts. This again may be considered a 
security risk. 

e When deploying web applications con- 
sisting of HTML or PSP pages on dif- 
ferent web servers, hyperlinks (bref and 
action properties in forms) pointing to 
these pages may need to be changed 
due to different web servers using dif- 
ferent mapping mechanisms. However, 
this can be resolved by using another 
Perl script to fix up the links. 

e The translator and run-time error mes- 
sages may not provide sufficient details 
to pinpoint the problem. 

e The translator is not a validating parser 
for HTML or Perl. It does not detect any 
syntax problems related to HTML (such 
as missing end tags) or Perl. 


Conclusion 

PSP provides a portable environment for 
developing dynamic web pages using fea- 
tures that are familiar to ASP and JSP de- 
velopers such as declarations, directives, 
expressions, and scriptlets. It provides a 
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simple, yet powerful custom tag support 
mechanism to encapsulate complex server- 
side behaviors and business rules into sim- 
ple XML-like elements that content de- 
velopers can use. The custom tag 
programming interface is similar to that 
of JSP, but does not require an XML-based 


tag library descriptor. These features, when 
paired with a wide array of Perl modules 
available on the Internet, make PSP a sim- 
ple, yet powerful tool to use if the appli- 
cations developed need to be deployed 
on different platforms running different 
web servers. The only requirements are 
that these web servers support the CGI 


interface and you must have Perl 5.005 or 
later installed on these systems. The PSP 
development is by no means complete. I 
will post new versions and extensions at 
http://www.playsport.com/psp_home/ as 
they are available. 


DDJ 





isti <TITLE>Nested Custom Tag Demo</TITLE> 
Listing One <r? 
<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN"> 
<!-- Simple changing font size demo --> <BODY> 
<HTML> <HEAD> <H3>PSP Nested Custom Tag Demo</H3><P> 
<TITLE>Simple Changing Font Size Demo</TITLE> <UL> 
</HEAD> 
<BODY> <test:loop repts=10> 
<H3>PSP Simple Changing Font Size Demo</H3><P> <LI> 
For loop incrementing font size from 1 to 5: <P> <test:if> 


<%-- You should not see this--%> 
<% for(1..5) { %> 
<!-- iterated html text --> 


<FONT SIZE="<%= $_ %>" > Size = <%= $_ %> </FONT> <BR> 


<% } %> 
</BODY> 
</HTML> 


Listing Two 


<test:condition><%= (rand > .5) %></test:condition> 
<test: then>Head</test:then> 
<test:else>Tail</test:else> 


</test:if> 
</test:loop> 


</UL> 


</BODY> 
</HTML> 


<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN"> 


<HTML> <HEAD> 

<TITLE>Simple Custom Tag</TITLE> 
</HEAD> 

<BODY> 

<H3>PSP Simple Custom Tag Demo</H3><P> 


The curent date and time: <test:date format="%Y-%m-%d %a %H:%M:%S" /> 


</BODY> 
</HTML> 


Listing Three 


<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN"> 


<HTML> 
<HEAD> 
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Listing Four 
#!perl 


#eoeoo dco aaidcai ioc iicaiiciaiick ick ak kak kaka ick kk kak ka ak kok ak kok kak kak 
# This script is generated by the PSP translator based on a PSP file. 


# import section 

use PSP::Writer; 

use PSP: :TagObjFactory; 
use CGI; 
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# custom tag modules import if any 
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(continued from page 82) 


# declaration section 
~~INIT~~ 


# PSP initialization 

my Sout = new PSP::Writer; 

my $request = new CGI; 

my $response = $request; 

my $psp = new PSP::TagObjFactory($request, $response, "errPage.tp1") ; 
my @PSP_stack; 

my $PSP_out = Sout; 


# header section 

~~HEADER~~ 

# PSP body 

~~BODY~~ 

# send buffered output to STDOUT 
Sout->flush(\*STDOUT) ; 


e e e 

Listing Five 
{ 
push @PSP_stack, Sout; 
push @PSP_stack, $PSP_out; 
SPSP_out = Sout; 
Sout = new PSP::Writer; 
my $PSP_tag; 
SPSP_tag = qq(<test:date format="%Y-%m-%d %a %H:%M:%S" />); 
my $tagobj_® = $psp->createTagObj($PSP_out, Sout, $PSP_tag); 
if ($tagobj_0->doTagStart() == 1) { 

my $pred_@ = 1; 

while (S$pred_@) { 

Spred_® = $tagobj_@->doBody() ; 
} 


} 

$tagobj_@->doTagEnd () ; 
Spsp->houseKeeping ($PSP_out, Sout); 
SPSP_out = pop @PSP_stack; 

Sout = pop @PSP_stack; 

} 


Listing Six 
package test::date; 


use POSIX; 
use PSP::TagHandler; 
@ISA = qw(PSP::TagHandler) ; 


# constructor 
sub new { 
my $class = shift; 
my $self = S$class->SUPER: :new(@_) ; 
Sself->{'format'} = '%c'; 
return $self; 
} 
# handler for processing the start of a custom tag 
sub doTagStart { 
my $self = shift; 
my Sout = $self->{'body'}->out(); 
my ($sec,$min,Shour,$mday,$mon,$year,$wday,Syday,$isdst) = localtime(); 
Sout->print (strftime ($self- 
>{'format'},$sec,$min,$hour, $mday, $mon, $year, Swday ,$yday,Sisdst)) ; 
return @; 
} 


# handler for the format attribute 

sub format { 
my $self = shift; 
Sself->{'format'} = shift if @_; 
return $self->{'format'}; 

} 

Is 


Listing Seven 
package test::loop; 


use PSP: :TagHandler; 
@ISA = qw(PSP::TagHandler) ; 


# constructor 
sub new { 
my S$class = shift; 
my $self = $class->SUPER: :new(@_) ; 
Sself->{'repts'} = undef; 
return $self; 
} 
# handler for processing the body of a custom tag 
sub doBody { 
my $self = shift; 
my Sbody = $self->{'body'}; 
my Sout = S$body->out; 
# output the tag body only for the specified number 
# of times 
if ($self->{'repts'}--) { 
Sout->print ($body->body) ; 


return 3; 
} 
return @; 
3 
# handler for the repts attribute 
sub repts { 
my $self = shift; © 
Sself->{'repts'} = shift if @; 
return S$self->{'repts'}; 
} 
is 
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Listing Eight 
package test: :if; 


use PSP::TagHandler; 
@ISA = qw(PSP::TagHandler) ; 
1’ 


e e e 
Listing Nine 
package test::condition; 


use PSP: :TagHandler; 
@ISA = qw(PSP::TagHandler) ; 


# constructor 
sub new { 
my $class = shift; 
my $self = $class->SUPER: :new(@_); 
return Sself; 
} 
# handler for processing the start of a custom tag 
sub doTagStart { 
my $self = shift; 
#check if enclosed in test::if 
my $parent = $self->findAncestorWithClass("test::if"); 
if (defined(Sparent)) { 
# save the parent object 
Sself->{'parent'} = Sparent; 
return 1; 
} 
$self->errorPage("errPage.tpl", 
"S&lt;test:condition&gt ; 
not enclosed in &lt;test:if&gt;&lt;test:if&gt;"); 
return Q; 
} 
# handler for processing the body of a custom tag 
sub doBody { 
my $self = shift; 
my $body = $self->{'body'}; 
my Sout = S$body->out; 
my $text = Sbody->body; 
my Sparent = $self->{'parent'}; 


# create condition variable in parent object 
Sparent->vars('condition', sprintf("%d", S$text)); 


return Q; 
} 
1; 
Listing Ten 


package test: :then; 


use PSP: :TagHandler; 
@ISA = qw(PSP::TagHandler) ; 


# constructor 
sub new { 
my S$class = shift; 
my $self = $class->SUPER: :new(@_) ; 
return $self; 
} 
# handler for processing the start of a custom tag 
sub doTagStart { 
my $self = shift; 


#check if enclosed in test::if 
my Sparent = $self->findAncestorWithClass("test::if") ; 
if (defined(Sparent)) { 
# save the parent object 
Sself->{'parent'} = S$parent; 
return 1; 
} 
$self->errorPage("errPage.tpl", 
"&lt;then:condition&gt ; 
not enclosed in &lt;test:if&gt;&lt;test:if&gt;") ; 
return @; 
} 
# handler for processing the body of a custom tag 
sub doBody { 
my $self = shift; 
my Sbody = $self->{'body'}; 
my Sout = Sbody->out; 
my $text = $body->body; 
my Sparent = S$self->{'parent'}; 


# check if a condition has been set in the parent object 
my Scond = $parent->vars('condition') ; 
if (!defined(Scond)) { 

S$self->errorPage("errPage.tpl", 

"&lt;test:then&gt; without &lt;test:condition&gt;") ; 

} 
# output the tag body only if the condition is true 
if ( $cond == 1) { 

Sout->print ($text) ; 
} 


return @; 
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dobe’s Portable Document Format 
(PDF) has become a standard format 
for creating and viewing documents 
that need to retain special formatting 
beyond what you can produce using 
HTML. The Adobe PDF Reader is built into 
Internet Explorer and Netscape Navigator, 
and is probably the most common plug- 
in downloaded for older browsers. As a 
result of the growing popularity and ac- 
ceptance of PDF documents, the develop- 
er community has turned to them to alle- 
viate some of the inherent printing and 
formatting weaknesses of HTML. 
Organizations most frequently use PDF 
documents to deliver blank forms to users. 
For instance, the IRS uses PDFs to dis- 
tribute tax forms to taxpayers who then 
download them, fill them out, and mail 
them in. This saves taxpayers the time and 
hassle of going to the Post Office or li- 
brary to find forms, and saves the IRS the 
postage and manpower of mailing them 
out. But that only scratches the surface of 
what you can do with PDFs on web sites. 
In this article, ’ll show you how to use 
Adobe’s Forms Data Format (FDF) Tool- 
kit to make PDF documents come alive. 
When used in conjunction with other 
tools, you can use PDF documents in place 
of HTML forms, interact with databases, 
and deliver highly formatted, completed 
documents, even if the documents use 
complex logic to derive the contents of 
the fields. 


The Adobe Forms 

Data Format (FDF) Toolkit 

The FDF Toolkit (http://partners.adobe 
.com/asn/developer/acrosdk/forms.html) 


Lauren is a partner at Spin Solutions, 


LLC. She can be reached at lhightower@ 
spinsolutions.com. 


http://www.ddj.com 


Examining the 
FDF Toolkit 


Making PDF files come alive 


Lauren Hightower 


targets developers who want not only to 
deliver blank forms to users, but also want 
to work programmatically with the data 
that populates the form. You can use the 


‘toolkit with the full version of Adobe Ac- 


robat to design smart forms that have func- 
tionality beyond static, blank PDF docu- 
ments. The key to unlocking the power 
of the toolkit lies in understanding Acro- 
bat Forms. 

Acrobat Forms are extensions to the PDF 
format that overlay a PDF document and 
let you add form elements like buttons, 
text fields, radio buttons, drop-down list- 
boxes, and the like. The elements you can 
add to a form are the same as those you 
can have on an HTML page. To add these 
elements, you use the Form tool in the full 
version of Adobe Acrobat. Once you add 
the fields using Acrobat, the FDF Toolkit 
lets you populate them programmatically 
before displaying them on the PDF. 

The FDF Toolkit is a threadsafe API you 
can run on any web server under Win- 
dows NT, Windows 2000, any flavor of 
UNIX, or Linux. You can use the toolkit 
with C/C++, Java, Perl, Visual Basic, and 
Active Server Pages (ASP). For this article, 
I'll use Windows NT 4.0, Visual Basic, and 
Active Server Pages. 


An FDF Example 

To illustrate how to use a PDF document 
with the FDF Toolkit, P’ll use the IRS Form 
4868, Application for Automatic Extension 
of Time to File U.S. Individual Tax Return. 
As Figure 1 shows, this form has five short 
sections and two calculated fields. 

The FDF Toolkit makes it possible for 
users to enter information directly into the 
PDF document, eliminating the need to 
create an HTML data-entry form. Howev- 
er, I prefer to assume the added work of 
creating an HTML form because I can do 
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more validation and error checking using 
JavaScript in a familiar HTML development 
environment. The Adobe-specific Acrobat 
Forms JavaScript Object Model, which 
comes integrated into the full version of 
Acrobat, is clunky and difficult to use. Be- 
cause users are so accustomed to PDF doc- 
uments being static rather than interactive, 
it is also an unfamiliar environment for users 
entering the data who might be stumped 
when they see an empty PDF form with 
no instruction on how to enter data. 

You can see the finished example in 
action at http://www.spinsolutions.com/ 
pdfArticle/. 


Creating HTML Forms 

For this example, I'll assume the user is a 
CPA who is filing an extension for a client. 
The basic client information is stored in a 
database on the CPA firm’s intranet. Us- 
ing a component written for another ap- 
plication, I can call a function GetClient 
to retrieve the Identification information 
required in Part I of Form 4868. 

The code that generates the HTML form 
in the extension.asp file is available elec- 
tronically (see “Resource Center,” page 5). 
The Visual Basic source code for retriev- 
ing the data from a relational database is 
also included. Because the focus of this 
article is not on COM objects or ASP, I 
won't delve into the details of either of 
those code snippets, but will assume you 
are familiar with a method of creating an 
ASP page with a form and populating the 
form fields with data. 


Marking Up the PDF Document 

You can retrieve a copy of IRS Form 4868 
from the IRS at http://www.irs.gov/. To 
mark up a document, you need the full 
version of the Adobe Acrobat software, 
Version 4.0 or above. To draw the fields 
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(continued from page 87) 

on the PDF document, you use the Form 
tool. For step-by-step instructions on 
drawing fields on a PDF, see the docu- 
mentation that comes with Adobe Acro- 
bat. Immediately after you finish drawing 
each form field, the Field Properties dia- 
log appears so you can define the field. 

For simplicity’s sake, I assign the same 
name to each PDF field that I assigned to 
the corresponding HTML field. For in- 
stance, the TaxPayerName field on the 
HTML form corresponds with the 7ax- 
PayerName field on the PDF. As you'll 
see, this lets me cycle through the results 
of the form and set values on the PDF 
without mapping specific form names on 
the PDF to specific form names on the 
HTML form. 

Acrobat gives you a lot of flexibility in 
defining the way a field looks and acts. I 
focus on the cosmetic aspects of the Prop- 
erties dialog because I do most of the pro- 
cessing of data using other tools and let 
the PDF simply display the results. Using 
the Appearance tab, I set each of the fields 
as Text fields and mark them as Read 
Only, prohibiting users from changing the 
information once the PDF is generated. 
The Format tab lets me set all of the num- 
ber fields in sections III, IV, and V on the 
form to Number fields that display with 
two decimal places and a dollar sign ($) 
preceding the number. 

Although I do most of the processing 
of data in the ASP page, I let the PDF doc- 
ument handle one of the calculations for 
me. The Total Liability field in Section V 
of the form is a simple calculation that 
sums lines 6, 7, and 8. To set this field to 
display the sum of those fields, I choose 
the Calculate tab, then select the Pick but- 
ton and add all three fields. 

Once the PDF is marked up, save and 
close it. The next step is getting the data 
from the HTML form and displaying it on 
the PDF document. You can download a 
marked up copy of the IRS form from 
http://www.spinsolutions.com/pdfArticle/ 
£4868. pdf. 


Using the FDF Toolkit 

Once you download the toolkit from http:// 
partners.adobe.com/asn/developer/acrosdk/ 
forms.html, you must install it. The in- 
stallation for each version of the toolkit is 
different, but to install the Windows ver- 
sion, simply double-click the download- 
ed executable and follow the directions. 
The download contains two DLLs: Fd- 
facX.dll and FdfTk.dll. You must install 
and register FdfacX.dll into a directory that 
has execute permissions (Adobe recom- 
mends \winnt\system32\inetsrv\ASP \ 
cmpnts). FdfTk.dll does not have to be 
registered, but it must be in the same 
directory as FdfacX.dll or in the system 
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directory (\winnt\system32). If you are 
running NTFS, both DLLs must have ex- 
ecute permissions. 

The FDF Toolkit can be used with a 
number of languages. For this example, I 
use it with VBScript in an Active Server 
Page. I instantiate the component the way 
I do any other component. Once instan- 
tiated, it has a series of methods and prop- 
erties that I can call. 

Listing One displays the ASP code that 
instantiates the object, then calls the func- 
tion FDFCreate, which returns an object 
of type FDFACXLib.FdfDoc with 36 meth- 
ods. One of the methods associated with 
the FDF object is FDFSetValue. I use this 
method to set the values for each field on 
the PDF document. The first code segment 
in Listing Two cycles through the fields 
on the HTML form and sets the value of 
the corresponding field on the PDF doc- 
ument. The second code segment calcu- 
lates the Balance on line 6 of Part II, then 
sets the value on the corresponding field. 

Listing Three shows the final step in 
creating the FDF— defining which PDF 
document it goes with and saving the file. 
(I’ve hard coded the paths for both files 
to make the example easy to read, but 
you should use the registry or an ini file 
to hold the values of a working directory 
so the application is easy to move from 
server to server.) 

Once I create and save the file, I can 
look at the generated FDF in a text editor 
to see what I generated. Listing Four shows 
the contents of the generated FDF file. 

The FDF Toolkit documentation de- 
scribes the format of the file in detail. In 
reality, I don’t need the FDF Toolkit to 
generate an FDF file, but it does make it 
easier. Because it is a simple text file, I 
could also use the FileSystem object to 
create and save an FDF file in the specif- 
ic FDF format. 


Displaying the Results 

To display the completed Form 4868, I 
redirect the user’s browser to the gener- 
ated FDF file using Listing Five. 

The big drawback of generating and dis- 
playing FDF files is that individual users 
must specifically set up their browser to dis- 
play FDF files in the Adobe Acrobat Read- 
er. To do this is not simple and could be a 
daunting experience for some inexperienced 
users. For instance, to accomplish this in 
Internet Explorer 5.0, users would: 


1. Open Windows Explorer, choose Tools... 
Folder Options...File Types. 

2. Scroll to FDF in the list of file types. (If 
FDF is not in the list, click the New but- 
ton and add it to the list.) 

3. Select the Change button, then select 
the Portable Document Format and 
click OK. Uf PDF is not an option, 
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(continued from page 90) nately, Adobe doesn’t offer a way to dy- display the completed PDF through a web 
users need to download and install the namically merge FDF and PDF files, al- browser like any PDF document. FDF- 
Adobe Acrobat Reader from http:// lowing you to redirect the user toa com- Merge is easy to install, but a little more 
www.adobe.com/.) pleted PDF document instead of an FDF _ difficult to use. 

4. Click Close. file. The best bet is to invest in FDFMerge, Because FDFMerge is a synchronous 

a tool from Digital Applications — utility, you must execute the application 
The instructions for associating FDF files — (http://www.digapp.com/). on a command line, then check for the 
with the Adobe Acrobat Reader are dif- FDFMerge is a command-line utility that | completion of the process before you can 
ferent for different browsers and different merges an FDF file and a marked up PDF, ___ display the PDF file to the user. Listing Six 
versions of the browsers. If your target au- _ then uses flattening technology to display shows how to do this in Visual Basic. 
dience is the general population, this is it as a completed PDF document. Once The ExecCmd function is included in 
probably not the best option. Unfortu- you merge the FDF and the PDF, you can the downloadable code. The ExecCmd 
function starts the utility and stops pro- 

— — =e Se se — wae _| cessing of the calling function until the 

oa fcc is _.. 7 = oS Co ea and . ith ; hat starts th He din ge ul an 2 utility has completed its task. By breaking 
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(continued from page 92) 

directly to the screen instead of redirect- 
ing users to a generated file. The benefit 
of this approach is that you can delete the 
file after generating it and it won't take up 
space on the server. It is also a more se- 
cure method of delivering sensitive data 
since a generated file is not stored on the 
server. 

A PDF document is a Postscript binary 
file. To write the file directly to the brows- 
er, the page displaying the document must 
have a content type of “application/pdf.” 
This automatically tells the browser to use 
the plug-in associated with content type 
“application/pdf,” the Adobe Acrobat 
Reader. 


Listing Seven shows the function Gen- 
erate4868, which opens and reads the 
generated PDF file, returns the contents 
of the PDF file in binary format to the ASP 
page, then deletes the generated PDF. 
Once the object returns the binary con- 
tents of the PDF file, the ASP page can 
display the PDF file in the Acrobat Read- 
er by writing the binary contents to the 
page with Listing Eight. 


Conclusion 

As the Web evolves into a true applica- 
tion platform, the necessity to gather, ma- 
nipulate, and display data in highly for- 
matted forms is outpacing the tools 
provided by Adobe. Until Adobe can of- 


fer a complete forms solution, using a 
combination of tools is the best strategy 
for delivering interactive forms. 

The IRS has implemented its own ver- 
sion of the fill-in 48608 form (http://ftp 
.fedworld.gov/pub/irs- fill/f4868. pdf). 
This version allows the taxpayer to fill 
in the fields directly on the PDF, but pro- 
vides no validation and no automatic 
calculation. This is fine for a simple 
form, but users are starting to expect 
smarter forms that are prepopulated with 
data and have automatic calculations and 
validation. 
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Listing One 


<% 
Set objFDF = Server.CreateObject ("FdfApp.FdfApp") 
Set FDF = objFDF.FDFCreate 

%> 


Listing Two 


<% 
For Each vpItem In Request.Form 


FDF.FDFSetValue vpItem, UCase(Request.Form(vpItem)), 


Next 


fpBalance = Request.Form("Liability") - Request.Form("Payments") 


FDF.FDFSetValue "Balance",fpBalance,False 
%> 


Listing Three 


<% 


FDF.FDFSetFile "c:\prj\pdfarticle\f£4868. pdf" 
FDF.FDFSaveToFile "c:\prj\pdfarticle\temp.fdf" 


%> 
Listing Four 
%EDF-1.2 


False 


»> 
endobj 


2 ® obj 


/EDF << /Fields 2 @ R /F (c:\\prj\\pdfarticle\\£4868. pdf) >> 
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The problem: Compiler runtime libraries allow only one thread at a time to be active in the heap. So on SMP systems, when 

multiple threads make concurrent heap requests, all but one will be blocked by the heap manager, nullifying the benefit of the 
extra CPUs. Worse yet, each time a thread is blocked, the OS invokes a context switch. The result: adding processors results in 
a vicious cycle of context switching that can prevent your app from scaling. 


The solution: SmartHeap™ for SMP. For NT, Solaris, HP-UX, AIX, DEC OSF, SGI IRIX. Partial user list: Netscape, HP, 
Lexis-Nexis, Lucent, NASDAQ, Computer Associates, Ericsson, Spyglass, Allaire, NetGravity, Software.com, Simware, RightPoint, 
OptiMark, Fidelity, Bankers Trust, Pinnacle, UPS, OnDisplay, i2 Technologies, New Era of Networks, Rockwell, eDocs. 
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oes ATT TEA A A TT TS TL TTT A TE EL LR RT RT RENE RET TTA TTR OL AE ET 


tis / 
i /T (TaxPayerName)/V (JANE MARIE DOE) >> Jon Dye Dyeje)e S 


<< /T (TaxPayerAddress)/V (123 MAIN STREET) >> 


( 
<< /T (TaxPayerCityStateZip)/V (ANYTOWN, CA 99097)>> al 7 
peas late Netwo 
<< /T (SSN_2)/V (11)>> wm 
( 
( 
( 


<< /T (SpouseGiftGSTExtension)/V ()>> 
<< /T (SSN_3)/V (1111) >> 

<< /T (SpouseSSN_1)/V (999)>> 

<< /T (SpouseSSN_2)/V (99)>> 

<< /T (SpouseSSN_3)/V (9999) >> 

<< /T (Payments)/V (2000) >> 

<< /T (GiftGSTExtension)/V (X)>> 
<< /T (Payment)/V (5800) >> 

<< /T (Liability)/V (5000) >> 

<< /T (SpouseGSTPayment)/V (89) >> 
<< /T (GSTPayment)/V (3000) >> 

<< /T (Balance)/V (3000)>> 


from 
dava Shared Data | January 1988 to Dec 1999 on one ff 


endobj ee CD-ROM 
trailer | pessoas = 
Isat 1OR | : r “ sa aa now! 
I soe 
Listing Five 
g Response.Write "<a href=temp.fdf>Click Here to see the PDF</a>" Sign up ice) § this special 
%> 
rogram and officially link your 
Listing Six prog y y 


spCommand = c:\fdfmerge\fdfmerge.exe -1 Log.txt -o web site 10) WTANTANAe lel Mnelan 
..and earn money! 


e:\prj\pdfarticle\temppdf. pdf 

c:\prj\pdfarticle\f£4868. pdf 

c:\prj\pdfarticle\temp. fdf 
vpShell = ExecCmd(spCommand) 


Listing Seven Go to www.ddj.com/affiliates/ 


Public Function Generate4868 (ByVal vhPDFFileName As Variant, 


Bie copes ag! eoicg ByVal vhFDFFileName As Variant) As Variant and complete an affiliate application. 
Dim vpShell As Variant 


Dim spCommand As String lf approved, Dr. Dobb’s will send you 


Dim opFileSys As Object 


‘encase aI ea eee official icons or text links to be placed 


Set opFileSys = CreateObject ("Scripting.FileSystem0bject") on your web site. Earn $5 for each 
spPDFTemp = opFileSys.GetTempName () 
spPDFTemp = Replace(spPDFTemp, ".tmp", ".pdf") Mt ' 5 or 
spPDFTemp = "c:\prj\pdfArticle\" & spPDFTemp magazine subscription that originates 
' GENERATE THE PDF FILE USING FDFMERGE aged an your site. 
spCommand = "c:\fdfmerge\fdfmerge.exe -1 Log.txt -o " & spPDFTemp & " " & 
vhPDFFileName & " " & vhFDFFileName 


vpShell = ExecCmd(spCommand) ' Run FDFMerge. 
' READ THE BINARY FILE AND SEND IT 
' TO THE USER 
If vpShell = @ Then 
Generate4868 = readBinFile(spPDFTemp) 
opFileSys.DeleteFile vhFDFFileName, True 
opFileSys.DeleteFile spPDFTemp, True 
Else 


Generate4868 = "There was an error in the generation." eiake 
Bnd CAA coin 


Don't let this exciting 
opportunity pass you by. 
Become a DDJ affiliate today! 


Set opFileSys = Nothing 
End Function 


e e e 

Listing Eight 

<% 

Set oPDFGenerator = Server.CreateObject("pdfArticle.clsInterface") —_—_—_—— 

vPDFContent = oPDFGenerator.ReadPDF (TempPDF) y SOFTWARE 

Response.BinaryWrite vPDFContent 

7 i. TOOLS FOR THE 
PROFESSIONAL 
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Back to the Future 


Michael Swaine 


n writing about Basic here recently, I 

gave short shrift to the original 

Kemeney and Kurtz Basic. I also missed 

an opportunity to reveal the program- 
ming skills of Bill Gates, Paul Allen, and 
Monte Davidoff in the original Altair Ba- 
sic that they wrote back in 1975. I sub- 
sequently said a little more about K and 
K, but was frustrated on the other front 
by the fact that, along with other re- 
searchers and historians — and apparent- 
ly Bill Gates himself—I didn’t know 
where the original Altair Basic source 
code was. 

Then, in May of this year, the tireless 
reporters at the British online publication 
The Register (http://www.theregister.co.uk/) 
did an interview with Davidoff, in which 
it was revealed that the legendary Altair 
Basic source code had been found. 

So, if you'll forgive me, I'll talk about 
Basic yet again this month. But not just 
Basic. With all the noise about Microsoft's 
XP, it seems timely to remind ourselves of 
another XP, the programming discipline 
called “Extreme Programming,” recalling 
what it’s about and asking what its role is 
in software development circa 2001. So I'll 
touch on that here, too. 

Despite the recent indications that I have 
some sort of obsession with Basic, the 
truth is that if I had the courage of my 
convictions, I would do all my program- 
ming in Lisp or Prolog, languages for 
which I have an incurable academic fond- 
ness and nostalgic attachment. So I want 
to show you a neat little parallel between 
Prolog programs and some of the struc- 
tures needed for this Semantic Web thing 
that Tim Berners-Lee has been promot- 


Michael is editor-at-large for DD]. He can 
be contacted at mike@swaine.com. 
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ing. (Double congrats to TBL, BTW, who 
became an FRS this year while his WWW 
celebrated its 10th birthday.) 

I also use the Rebol language for some 
tasks, and the Rebol crew is currently pro- 
moting something called the “X Internet.” 
This is not XP (Microsoft’s or the other 
one) and not the Semantic Web, but what 
is it, exactly? I’ll try to nail that down by 
the end of the column. In a way, this is a 
backward- and forward-looking nostalgia- 
for-the-future kind of column this month, 
and where better to start the nostalgifica- 
tion than with the lost innocence of Bill 
Gates. 


Bill’s Lost Code 
Finally, after 25 years, the source code for 
the original Altair Basic has surfaced. I can’t 
show it to you here, but you can see it for 
yourself, or at least a very good copy of it, 
at Bill’s alma mater. The code was long 
sought by journalists and historians. Back 
in 1987, a French journalist, André Warus- 
fel, asked Bill Gates if Bill still had the code, 
and if so, would he make it available. Bill 
said that he did and that he would, but he 
didn’t. Apparently Bill never found the 
code, if he ever looked for it, and over the 
years others, especially David Mery, re- 
peatedly bugged him about this missing 
8080 assembly code, but to no avail. 
Then, in 1999, the code suddenly came 
to light. So back-burnered had my own 
desultory search for the lost code become 
that I didn’t even notice that it was over 
until this spring. The listing in question 
had gotten shoved down behind some 
piece of furniture in an office in Harvard’s 
Aiden Computer Laboratory back in 1975. 
This piece of furniture may have been a 
filing cabinet, or then again it may have 
been something else. The sources are in- 


Dr. Dobb’s Journal, August 2001 





consistent on exactly what kind of piece 
of furniture it was. In any case, rescued 
from this hidey-hole in 1980, it lived for- 
gotten in a filing cabinet (definitely a fil- 
ing cabinet) for 19 years, until 1999. With 
Bill’s permission, it now resides in the 
Pusey Library at Harvard. 

Actually, it’s not quite the original, if 
you go by the version number. What Har- 
ry R. Lewis, the Gordon McKay Professor 
of Computer Science and Dean of Har- 
vard College, found in that filing cabinet 
is marked with a version number 1.1. And 
what’s on display in the library is really a 
recent copy. Gates and Allen also now 
have copies, and the original is locked up 
somewhere. 

What’s there is 114 pages of code, con- 
stituting the 4K and 8K Basic source. Gates 
is credited with writing the run-time stuff, 
Paul Allen the nonrun-time stuff, and 
Monte Davidoff the math package. The 
copyright, though, is in the names of Gates 
and Allen. 

Yeah, yeah, but was Gates any good? 

So now that we can examine the ev- 
idence, what’s the verdict? Was Bill 
Gates a good programmer, or just the 
luckiest person in the world? Actually, 
we didn’t have to wait for the printout 
to answer that. Reuben Harris has dis- 
assembled an early Altair Basic binary, 
and his conclusion is that the 1975 Bill 
Gates was a true hacker in the finest 
sense of the word. 

In more than one instance, for exam- 
ple, Gates’s code jumps to the second or 
third byte of a 2- or 3-byte 8080 instruc- 
tion. What looks like an error is just a way 
to save a byte here and there. 

This is what some might call “running 
light without overbyte.” And that was 
what mattered in 1975. Dennis Allison, 


97 







A JAVA 
A (++ 
A XML 
A SOAP 
A C# 

A EJB 


presented by 


8 EVENTS 
group 





SD is all you need, 


SD Web Services World provides the most complete training for development teams: 


A Components 
A Distributed Computing 


A Collaborative Programming 
A Software Design & Process 
A Service Oriented Architecture 
A Web Services Business Imp 


Keynotes include: 


A Bjarne Stroustrup A Rene Bonvanie 
A Simon Phipps 


A Daniel Guinan 


A .NET 
A ebXML 
A WSDL 
A UDDI 
A P2P 


ementation A CORBA 


latinum Sponsors: 


FEET @ Sun 


microsystems 


| 


invent 








(continued from page 97) 
this magazine’s cofounder, once told me 
that there is only space and time, and 
you have to decide which to conserve. 
(Those are not even remotely his words, 
but I’m pulling this one out from behind 
my own mental furniture after more than 
a decade. He did say something to that 
effect.) In 1975, when speed was a lux- 
ury and saving 3 bytes might mean the 
difference between being able to imple- 
ment a feature and not being able to im- 
plement it, what you conserved was 
space. Bill was very good at this. 

As were Allen and Davidoff, I suspect. 


ring extemporaneous pitch for XP, but 
when I asked her later if she were an XP 
devotee, she denied it. She was very cu- 
rious about XP, she said, but not a devo- 
tee. I gathered that she couldn’t see how 
it fit into her work. That reaction seems 
to be common. Part of the appeal of XP 
is its programmer-centric worldview. 
Among programming methodologies, XP 
must be, at least at first look, the least ap- 
pealing to corporate bosses (unless they 
happen also to be programmers), because 
it puts the business people and the busi- 
ness interests firmly in their place, and 
elevates, or rather highlights, programmers 


extreme about it? Kent’s answer is that it 
takes obvious, common sense principles 
and practices to extreme levels. For ex- 
ample: 


e If short iterations are good, make them 
as short as possible— hours or minutes 
or seconds rather than days or weeks 
or years. 

e If simplicity is good, always do the sim- 
plest thing that could possibly work. 

e If testing is good, test all the time. Write 
the test code before you write the code 
to test. 

e If code reviews are good, review code 


continuously, by programming in pairs, 
two programmers to a computer, tak- 
ing turns looking over each other’s 
shoulders. 


and their issues. It works, though, at least 
for some people and some projects. 

But if Extreme Programming is just a 
new selection of old practices, what’s so 


And what about Davidoff? He’s got his 
own business these days, Alluvial Soft- 
ware, where he programs mostly UNIX 
systems and sings the praises of Richard 
Stallman and the other Free Software and 
Open Source Movement heroes who are 
making his old friend Bill so nervous these 
days. His favorite language is Python. 


Extreme Programming: 

An Anti-Executive Summary 

I met with Kent Beck over lunch recent- 
ly, and we did that author thing of ex- 
changing copies of our books. For what- 
ever reason, I hadn’t gotten around to 
reading Kent’s book Extreme Program- 
ming Explained, (Addison-Wesley, 2000, 
ISBN 201-61641-6), so I was grateful for 
the autographed copy and dug into it im- 
mediately. 

My previous exposure to the discipline 
of Extreme Programming had been through 
the writing of Ron Jeffries, but Kent is the 
guy who more or less invented Extreme 
Programming (XP). 

He’s also a thoughtful and entertaining 
writer. Before I knew it, I had zipped 
through two books in the Addison-Wesley 
XP series, the other being Planning Ex- 
treme Programming by Kent Beck and 
Martin Fowler (Addison-Wesley, 2001, 
ISBN 0-201-71091-9), and had refreshed 
my memory on XP. Perhaps I can do the 
same for some of you, or point others to 
Beck’s books. Kent is quick to point out 
that the ideas in XP are not his invention. 
They are good programming practices and 
principles, many of them probably as old 
as coding. Some are just good ideas in 
managing any complex task. But XP rep- 
resents a cogent rethinking of the inter- 
action of these practices and principles, 
and results in reduced project risk, better 
responsiveness to business changes, and 
greater productivity throughout the life of 
the system. 

That, anyway, is the claim. XP has its 
supporters and detractors, but it also 
evokes in people a characteristic mixed 
reaction. I saw that reaction when I par- 
ticipated in a panel discussion recently. 
One of the other panelists delivered a stir- 
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There are other principles and values 
that define XP; Kent’s book is a good place 
to learn about them. , 

XP differs from most programming 
methodologies in that it doesn’t encourage 
a lot of up-front planning. XP assumes that 
the goals will change repeatedly in the 
course of the project and emphasizes flex- 
ibility. In this, it seems much more in tune 
with the realities of software development 
in the real world— anywhere outside NASA 
anyway — than other methodologies. That 
may be another thing that attracts people 
like my friend on that panel to XP. 

What gives them pause, I suspect, is the 
team programming. It’s certainly what 
keeps me from implementing XP in my 
programming. Ain’t got no team, can’t do 
no team programming. 


ak 


SOFTWARE 


Extreme Programming 

And Open-Source Projects 

The requirement— and Kent insists that 
it is a requirement— of programming in 
pairs is particularly difficult for open- 
source projects, in which programmers 
collaborating on code may not even be 
on the same continent. But many XP prin- 
ciples and goals are shared by open- 
source development. 

Open-source projects are typified by 
short release cycles, by programmers re- 
viewing one another’s code, by an open- 
ness to patches, by programmers working 
on the parts of a project that most appeal 
to them or for which they just got an 
idea— all of which typify XP development. 

Can XP be applied to open source? 
Since open-source projects are character- 
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ized by voluntary participation, I don’t see 
how any programming methodology can 
be imposed on them. But XP principles 
can be promoted and encouraged in open- 
source projects, as they seem to fit, and 
there is nothing to prevent programmers 
working on an open-source project from 
working in pairs. 

As one developer who goes by the 
handle chromatic has pointed out (http:// 
linux.oreillynet.com/pub/a/linux/ 
2001/05/04/xp_intro.html), to the out- 
side observer both XP and open-source 
development look like pure chaos. 
“Somehow, though,” chromatic says, “or- 
der emerges, and programmers produce 
good code that meets requirements and 
meets or beats the schedule and the bud- 
get. At least, when [the methodology is] 
applied properly. The secret is knowing 
when, where, and how much.” Kent 
Beck agrees that XP isn’t for everyone, 
everywhere. “There are times and places 
and people and customers that would 
explode an XP project like a cheap bal- 
loon,” he says. “It is important not to use 
XP for those projects.” 

And he understands that some people 
will pick those aspects of XP that they like 
and skip the others, which may happen— 
may be happening—with open-source 
projects. “There are practices in XP that 
are a good idea regardless of what you 
think about the whole picture. You should 
do them.” 


Prolog, RDL, and the Semantic Web 

I talked about the Semantic Web last 
month. Its goal is to empower programs 
and machines to reason about informa- 
tion by virtue of having some under- 
standing of what that information actual- 
ly means. One of the core technologies 
necessary for the realization of the Se- 
mantic Web is RDF/RDF Schema, which 
can be thought of as a language for the 
description of things and their associated 
types. 

As Bijan Parsia wrote on April 25 over 
at xml.com, the Semantic Web is really an 
artificial intelligence project, if only the 
words “artificial intelligence” weren't poi- 
son in the public perception. And RDF 
has a strong affinity for one of the pri- 
mary languages used in artificial intelli- 
gence work, Prolog. 

Prolog programming consists of build- 
ing a knowledge base of assertions and 
querying it. An assertion, which looks a 
lot like a function, asserts some fact. (It’s 
a fact as far as the knowledge base is con- 
cerned, anyway.) 


owns(mike_swaine,moby_dick). 


is a Prolog assertion. There’s a subject- 
verb-object structure to these assertions, 
usually. In this case, mike_swaine is the 
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subject and moby_dick the object of the 


verb (technically, the functor) owns. We 
might want to be clearer about whether 
it’s a whale or a book that I own: 


owns(mike_swaine, book(moby_dick, 
herman_melville)). 


We use the following syntax to query 
the knowledge base: 


?- owns(mike_swaine, book(moby_dick, 
herman_melville)). 


This Boolean query will return True or 
False when run against the knowledge 
base consisting of our one assertion. 


?- owns(mike_swaine, Book) 


is a more interesting query because it con- 
tains a variable. (The syntax convention 
is that terms starting with a capital letter 
or underscore are variables.) Run this 
query against our knowledge base and 
you get the response 


Book = book(moby_dick,herman_melville) 


Using RDF/RDF Schema, you also find 
yourself producing three-term structures 
consisting of 


e a predicate (a URD, 
e a subject (a URD, and 
e an object (a URI). 


It’s ridiculously easy, if somewhat cum- 
bersome, to map these RDF triples to Pro- 
log assertions: 


"http://www.bookstore_database/functors/ 
owns. html" ("mail:mike@swaine.com", 
"http://www.bookstore_database/books/ 
moby_dick.html"). 


Judicious use of namespaces can declut- 
ter such an expression. 

So far this is just representation, but Pro- 
log’s power is in what it lets you do with 
these knowledge bases once you've built 
them. Prolog is the closest thing there is 
to logic implemented as a programming 
language, and it is ideally suited for de- 
ductive reasoning and proof construction, 
which are items on the To Do list for the 
Semantic Web. Prolog provides a won- 
derfully natural way of expressing facts, 
and a naturally wonderful way to imple- 
ment a logic layer on those facts, putting 
the AI in the SW. 


What's This X Internet? 

George Colony, the CEO of Forrester Re- 
search, not the doomed fisherman in 
The Perfect Storm, coined the term “X 
Internet.” It’s his buzzword for encap- 
sulating his contentious contention that 
the Web will be replaced in a couple of 
years, just as the Web itself replaced 
WAIS and Gopher, and that it will be re- 


bttp://www.ddj.com 


placed by a new paradigm of distribut- 


ed executables. 

Distributed, peer-to-peer Internet- 
working is clearly on the rise, and you’d 
have to be blind not to be aware that 
executable, live content is more and 
more with us. Nothing profound in not- 
ing these trends. Colony’s boldness lies 
in claiming that this executable Internet 
paradigm will replace the sharing of or- 
dinary content. That’s bold and, it seems 
to me, about on a par with predictions 
of the paperless office. 

Words and pictures aren’t going to go 
away just because we've figured out how 
to make refrigerators talk to DVD play- 
ers. Even if new protocols replace HTTP 
and HTML, I don’t see the investment in 


good web sites with useful content turn- 
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ing to dust under the wheels of the X 


Internet. that sald, the Bxecutable In- 
ternet is a powerful idea, if not a par- 
ticularly original one. It has a large over- 
lap with the Semantic Web, but a more 
marketable name. 

Mac developers: Next month [’ll re- 
port on some feedback from attendees 
at Apple’s Worldwide Developer Con- 
ference, which took place in May. Mean- 
while, have you discovered http://www 
.macintoshdeveloper.com/ yet? It's DDJ’s 
site for Mac programmers, and features dai- 
ly news briefs, columns, and other good- 
ies — including a column by yours truly. 


DDJ 


CODE 


YOUW’VE GOT TO BE KIDDING! 
TACKLING THREADS IS A 
NIGHTMARE WITHOUT ETNUS 
TOTALVIEW 5, THE LEADING 
DEBUGGER FOR COMPLEX 
CODE. BREAKTHROUGH 
CAPABILITIES GIVE YOU 
COMPLETE CONTROL AT THE 
THREAD LEVEL. IDENTIFY 
HARD TO FIND BUGS IN CODE 
WITH THREADS, MPI, AND 
OPENMP ON ALL MAJOR UNIX 
AND LINUX PLATFORMS. SO 
STOP KIDDING AROUND. TRY 
TOTALVIEW 5S FOR FREE AT 
WWW.ETNUS.COM/TOTALVIEWS, 
OR CALL SBODO-656-3766. 


ALVIEW-. 


ee oe Gee © 





setts aie 


101 


IMAGE 
FORMAT 
SUPPORT 


IMPORT/EXPORT 





technology and solving programmer's Ss problems hes produced the engines listed below. Visit our web site for full details on available 



































roar at LEADTOOLS products which utilize these powerful PAG HS 
-Lossy 
-Lossless 
MPEG 
JBIG 
PNG 
SGI 
PSD 
PCD 
EXF 
FPX 
EPS 
lOCA 
MODCA 
BMP 
DIB 
Ico 
CUR 
ANI 
CLP 
PCX 
DCX = 
IMG 
se Fan Import and export 60+ raster file formats - Supports a wide range of | Powerful annotation capabilities - Enable the addition of text, highlights, 
MAC compression options, bit depths (up to 64) and color spaces. Progressive sticky notes, audio, ellipses, buttons, lines, arrows, rectangles, polygons, 
TGA and non-progressive modes, multi-page, animation and non-image data. redaction, hotspots, freehand scribble, pointers, bitmaps, stamps, rulers 
RAS Image processing - Transforms- resize, rotate, linear and bicubic | 4d hyperlinks all with user-defined security features. 
ae interpolation, flip, invert, reverse, crop, underlay, shear, transpose, auto © Document image processing and clean-up- Despeckle, deskew, 
OS/2 BMP deskew and combine. Filters - sharpen, intensity, saturation, histogram, inverted text, removal of dots, blobs, lines, borders, hole punches, 
AWD posterize, median, edge, noise and more. Spacial filters - gradient, | charactersmooth. Region of interest, preview of changes and composite 
IFF laplacian, sobel, prewitt and more. viewing of the modified regions. 
ot Display - scroll, scale with interpolation, dither, contrast, brightness, witha Optimized viewing of bitonal images - Specialized display filters 
CUT choice of over 2,000 special effects. including FavorBlack and ScaleToGray, bilinear and bicubic interpolation. 
pt : Scanning, printing, imaging common dialogs, thumbnail browser, image Also includes ultra-fast CCITT G-3, G-4 and rotation, JBIG compression 
CALS list, database imaging functions, screen capture and much more! and a pan-window. 
WAN 
WPG 
WMF 
EMF 
AVI DICOM 3.0 - Supports the latest specs, including all standard lOD classes import/export - DWG, DXF, DWF, EMF, WMF, CGM, DGN, DRW, HPGL, 
FLC and modalities (CR, CT, MR, NM, US, RF, SC, VL, Worklist, etc.) and HPGL2, PICT and LEAD VEC (LEAD's proprietary file format) in native 
— DICOM directory. form. 
PNM DICOM communications protocol - COMPLETE support including all Editing - 14 different primitive object types (Vertex, line, rectangle, 
PBM Service Classes (Verification, Storage, Query/Retrieve, Patient polyline, polybezier curve, polygon, ellipse, circle, arc, text, pie, chord, 
PGM Management, etc). High-level communications functions to simplify the polydraw, and raster). Group objects into layers, copy or move objects 
se creation of DICOM client/server applications. between layers, lock individual layers. Add, edit, delete, rotate, translate 
TIE 6.0 Optimized image processing - The richest in the industry supporting Nd scale objects and layers. Convert points from world space to screen 
-Multipage 1,2,3,4,5,6,7,8,12,16,24 and 32 bit images. Includes 8-16 bit grayscale | Spaceand vice versa. Pixel accurate hit testing. 
JPEG display with window leveling and LUT support. Also includes 3-D viewing (lighting, shadow, camera), 3 vector engines 
— Includes Medical specific measuring and mark-up annotations like cross | (GDI, OpenGL and DirectX), convert or overlay vector drawings to any 
-RLE product, ruler, point and protractor. LEAD supported raster format. | 
-LZW 
~CCITT 
-CCITT G31D 
-CCITT G32D 
-CCITT G4 Support for client server development over the Internet or LAN Play, Edit and Save - Comprehensive support of multimedia formats 
set Provides a framework for sending /receiving commands (LOAD, SAVE, _ including AVI, WAV, MIDI, SND, AIF, AIFC, MPEG-1, MPEG-2 files. 
-Grayscale CREATEWIN, etc.) from one computer to another. Capture - Capture multimedia data from any Video for Windows or 
-Bitonal Control Image processing remotely - Allows for creation of "remote DirectShow capture device. 
-CMYK control" type applications where image processing can take place on a 
sacoae remote computer. 
Small COM objects and ActiveX controls - Internet enable 
VECTOR LEADTOOLS Raster imaging functionality but greatly reduce 
ba redistribution requirements. 
DXF 2-D New HTTP and FTP functionality - Provides programmatic control of 
DXF 3-D FTP & HTTP servers. Happy Anniversary, LEAQ! 
oe Upload Control - For sending files to an HTTP server. www. leadtools. com 
DWG : | | 
CGM 
DGN 
HPGL 
HPGL2 


PICT 

















C PROGRAMMING 


The Best of the 
Penguin's Quest 





Al Stevens 


t’s my anniversary again. I started writ- 

ing this column in the August issue of 

1988, 13 long years ago. Among the 

many benefits of this assignment is a 
collection built over the years of an im- 
pressive technical library. Trade journal- 
ists get lots of books and software from 
publishers who hope we'll favorably re- 
view their products and, when absolutely 
necessary, we might even buy something. 
Having limited bookshelf space, I peri- 
odically purge the shelves of the older 
stuff and donate whatever I don’t need 
any further to a local computer user’s 
group. I made such a contribution last 
week. Since I shifted the emphasis of 
this column to Linux, the books have 
been pouring in. My shelves are jam- 
packed with Windows programming 
books and software, and there is no 
room for the new Linux books that ar- 
rive every week. As I pulled down the 
stuff and boxed it up for the user’s 
group, something occurred to me. Since 
Linux applications software does not 
come in pretty, multicolored, glossy box- 
es with expensive manuals and CD- 
ROMs, we computer users won’t have 
any Linux applications to give away 
when the apps become obsolete. But 
worse, there will be no charitable con- 
tributions we can write off. Linux soft- 
ware is free. Mostly you download it 
from the Internet with the blessings of 
its altruistic authors. I wonder if these 
upstart Linux developers realize the eco- 


Al is DDJ’s senior contributing editor. He 
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nomic impact their generosity will have 
on a system of free enterprise that de- 
pends heavily on tax write-offs — de- 
preciation, contributions, and all that. 
Maybe that’s what that Microsoft execu- 
tive meant when he said that open 
source is unAmerican. 


The Editor’s Editor 

A recent column generated more mail 
than any column in the baker’s-dozen- 
year history of the “C Programming” col- 
umn. When I described my quest for the 
perfect Linux programmer’s editor and 
explained why, for me, emacs is not that 
editor, the mail just came pouring in. 
Nothing stirs a programmer’s heart more 
than a discussion of text editors, and you 
responded in kind. 

I had worried that my complaints 
about emacs might generate flames from 
its loyal users. I admitted that my prob- 
lems were because I did not understand 
and could not easily find adequate doc- 
umentation about how to configure 
emacs, which involves a text file named 
“emacs” that uses a Lisp-like script lan- 
guage. I was wrong about the flames. 
Maybe there are emacs zealots who 
would flame a foundering columnist who 
was struggling with their favorite tool, 
but those programmers must not be read- 
ing my column. Or else they think I’m 
not worth the trouble. 

Most of you who responded confessed 
having had similar difficulties with emacs 
as you learned its use. Many of you ex- 
plained how to address one or more of 
my issues. Many sent copies of your own 


Dr. Dobb's Journal, August 2001 





emacs configuration files. Interestingly, 
not one reader was able to address all 
the issues, although all the issues did get 
addressed by a combination of the mes- 
sages. Several of you reported that you 
could not find how to fix one or anoth- 
er of my problems, even after a thor- 
ough search of the online help docu- 
mentation. These were experienced 
emacs users, too. 

I won't publish all those .emacs con- 
figuration scripts here. If you need one 
of the solutions, let me know and I'll for- 
ward one of the appropriate responses. 


Nedit 

Another camp of respondents reported 
that they, too, had dismissed emacs as 
being too arcane for their tastes. These 
readers each named their favorite editor 
as something I should consider. An over- 
whelming majority supports an open- 
source editor called “nedit” (http:// 
www.nedit.org/). Nedit is included on 
the Mandrake 7.2 distribution I run on 
my laptop, so I was able to try it. This 
version (5.1.1), dated March 2000, is the 
most recent. Like the Kwrite editor I men- 
tioned before, nedit does not do word 
wrapping intuitively. Maybe a newer ver- 
sion will do it better. Word wrapping is 
not an essential feature for a program- 
mer’s editor, but I have to write prose in 
addition to code and need to wrap my 
words. This version of nedit does not in- 
tegrate well with the font setting system, 
either. I haven’t yet gotten it to set a font 
I can see well on the laptop screen. Fi- 
nally, nedit uses the Del key wrong. It 
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works the same as the BkSp key, which 
is counterintuitive. I don’t need two char- 
acter delete keys that delete the charac- 
ter to the left of the insertion point. There 
doesn’t seem to be a way to change it, 
either. Nedit looks like a full-featured 
programmer’s editor, but I’ll have to wait 
until I see a more recent version before 
I choose it for mine. 


VIM 

A few of you wrote that your editor of 
choice is VIM (which stands for “VI iM- 
proved”), Vi is the venerable old *nix 
text-mode editor from the days of yore 
and VIM runs vi in a window with oth- 
er enhancements. Those few readers 
each accompanied their confession with 
an apology as if they had been caught 
doing something naughty. Apparently it’s 
not cool to like vi and its descendants, I 
used vi quite a bit in the past. It was the 
best of two editors available at my ISP’s 
server site when I was experimenting 
with CGI programming a few years ago. 
The other editor was TECO. I don’t care 
to repeat either of those experiences, so 
I haven’t looked into VIM at all. 


Red Hat Linux 7.1 

Last month, I reported from the rural 
farmland of Pennsylvania that my exile 
was exacerbated by having to use an old 
version of Red Hat Linux on an old lap- 
top. I wanted to report about Red Hat’s 
latest distribution, Version 7.1, which in- 
cludes lots of new stuff and the latest 
Linux kernel, Version V.4. But I couldn’t 
because the release candidate would not 
install on the old laptop, which did not 
have sufficient memory and hard disk. 

This month, I’m back on the farm, but 
with my newer laptop fresh from the re- 
pair shop. It has plenty of space and I 
have the official released RH 7.1, not just 
a release candidate. Guess what? 7.1 
won't install on this newer laptop either. 
Consequently, I’m still stuck in the dark 
ages, running the 2.2 kernel under Man- 
drake 7.2. 

(Now, please don’t rush to your mail 
writer to demand to know why I am not 
using Mandrake 8.0, recently released 
with the new 2.4 kernel. I am not us- 
ing it because I don’t have it. It’s as sim- 
ple as that. We trade journalists get lots 
of freebies, but not everything comes 
our way.) 

There’s nothing exotic about my hard- 
ware, just your plain, vanilla Toshiba 
Satellite P466. But Red Hat’s installation 
program freezes consistently during the 
process where it’s asking about the 
mouse and the screen and all that. To- 
tal lockup. Reboot required. Installing in 
text mode does not fix it. The stupid setup 
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program sees a graphical video system and 
ignores your decision to run the instal- 
lation without a GUI, I had this problem 


KDevelop looks and 
feels a lot like 
Microsoft's 
Visual Studio 





with the release candidate, too, and 
hoped it would get cleared up before 
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the candidate got sworn in, but no such 
luck. It makes you wonder. Why doesn’t 
the leading Linux distributor ensure that 
their setup program works with one of 
the most popular laptops? Makes you 
wonder, too, what else might be wrong 
in Red Hat country. Read on. 

I successfully installed first the release 
candidate and then the official released 
version on my desktop at home before 
leaving on this trip, and I have the fol- 
lowing items of bad news to report. First, 
some things got broken between the re- 
lease candidate and the release. For one, 
the KDE dialer program that connects my 
modem to my ISP got flaky. It can’t seem 
to stay in sync with whether the connec- 
tion actually went through or not. It 
worked fine under the release candidate. 
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For another, the annoying behavior of ear- 
lier versions that require you to tell it to 
shut down the OS twice has returned. 
There are other problems with 7.1. I 
reported last month that Red Hat dis- 
tributed a broken, unauthorized GCC 
compiler in Version 7.0 and that people 
were upset about it. Red Hat did not cor- 
rect this situation in the new release. I 
guess it would have required them to 
admit they were wrong. Heaven forbid. 


KDevelop 

Red Hat 7.1 does include recent versions 
of the KDE GUI operating environment 
and the KDevelop IDE. This is a good 
thing, because programmers can use this 
distribution to experiment with devel- 
oping KDE, GNOME, and QT applica- 
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tions from within KDevelop. But exper- 
iment is all you should do unless you 
upgrade the compiler and recompile the 


You can’t learn 
Linux administration 
by using a cookbook 

approach 


libraries. It wouldn’t be wise to distribute 
software developed and tested with an 
unauthorized compiler. 
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KDevelop looks and feels a lot like Mi- 
crosoft’s Visual Studio. It supports de- 
velopment of programs under the vari- 
ous GUI operating environments and 
console applications, too. It has a decent 
programmer’s editor and integrated de- 
bugger and has become my Linux de- 
velopment platform of choice. 

(Being on the farm again, and not hav- 
ing KDevelop installed on this laptop, I 
am writing the following description from 
memory. If I get something wrong, be ad- 
vised that I shall wear my flame-retardant 
long johns all next month. Fire away.) 

Among KDevelop’s strong points is the 
automatic generation of scripts and con- 
figuration files necessary to integrate an 
application’s source code into the open 
source way of distribution. To distribute 
your program to all its potential users, 
you must set the code up to be built and 
installed on all the *nix platforms that 
the program should support. This pro- 
cedure involves automatic source code 
and makefile configuration scripts. 

When you install an open-source pro- 
gram, you typically start by running the 
/configure command from within the 
source-code directory followed by the 
make install command. The first com- 
mand examines the application’s con- 
figure scripts, source code, and some 
system files that cooperating platforms 
must include. If everything is in place, 
the command generates the makefile. 
The second command builds the appli- 
cation from source code to execute on 
your configuration. Along the way, the 
procedure determines whether you have 
the prerequisite libraries installed on your 
system. These processes involve utility 
programs called “autoconf” and “au- 
tomake” that operate on configuration 
scripts that the open-source developer 
provides. 

Presumably, by using KDevelop, you 
can let the IDE generate the configura- 
tion scripts for you. How nice to be able 
to ignore these abstruse details of auto- 
matic configuration, building, and in- 
stallation. But it might be a good idea to 
learn the architecture that underpins those 
procedures. We programmers like to raise 
our levels of abstraction to higher layers 
sO we can ignore underlying technical de- 
tails. That’s why we have high-level pro- 
gramming languages, information-hiding 
class libraries, IDEs, and such. But even 
though you might not want to be that 
close to the metal, sometimes you need 
to know what the metal is all about. Con- 
sider the work of a network administrator. 


Administration 

Another thing got broken in Red Hat 7.1, 

which might be because of the new 4.2 
(continued on page 109) 
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kernel. There are several Windows and 
Linux machines at my house, and the 
main desktop is also the gateway to the 
Internet. One dialup connection lets the 
other machines access the Internet. I 
used procedures from the Chris Negus 
book, Red Hat Linux 7 Bible (Hungry 
Minds, 1999), to configure the gateway 
and the other computers, which involves 
some trickery with the fwchains com- 
mand and something called “IP mas- 
querading.” The procedure worked fine 
with Red Hat 7.0 and does not work with 
7.1. Something is wrong with DNS res- 
olution that was not wrong before. 

All of which leads me to a conclusion 
I made about Linux administration. You 
can’t learn it by using a cookbook ap- 
proach. You can’t simply find a proce- 
dure that works in one particular book 
author’s configuration and expect to un- 
derstand enough about it to deal with 
things the author never encountered. In 
this case, the author was writing about 
a previous version of the kernel. Even 
so, authors of distribution-oriented books 
tend to teach the use of the GUI con- 
figuration tools that accompany the dis- 
tribution, and those tools hide the de- 
tails of the underlying architecture from 
their users. 

Linux is configured with combinations 
of internal kernel tables, configuration 
text files, and shell scripts. You manage 
internal table entries with commands 
from the command line, and you man- 
age the configuration files and scripts 
with your favorite text editor. You do, 
that is, if you know what you are doing. 
The distribution vendors, in their zeal to 
provide a dumbed-down interface for 
the typical user, and in their ultimate 
hope to make Linux a mainstream desk- 
top operating system (not in my lifetime), 
wrap the administration commands and 
file management in GUI applets. Only 
they never seem to get it right. The au- 
thors teach the applets rather than the 
underlying architecture, and the applets 
never seem to work 100 percent. Authors 
are usually limited in their experience to 
the configurations at their houses, so 
they rarely anticipate everyone’s admin- 
istrative needs. And the distribution ven- 
dors keep changing the applets in an ef- 
fort, I suppose, to get it right. The books 
are usually out of date by the time they 
hit the shelves, and even when they are 
not, you never learn quite enough to get 
the job done. 

I have been criticized for depending 
too much on books to find solutions to 
Linux problems. I should be using news- 
groups wherein, I am told, power users 
lurk, waiting for novices like me in need 
of their expert assistance. My experiences 
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with newsgroups indicate that you can 
indeed find quality help there but, like 
everything else on the Internet, you have 
to separate the expertise from the bovine 
droppings, and when you are looking 
for technical details, it’s not always easy 
to tell the difference. And, of course, you 
expose yourself to fools who similarly 
lie in wait only to flame you for not al- 
ready knowing the answer to the ques- 
tion you are asking. Unmoderated anar- 
chy has its place, I suppose, but it takes 
its toll in time wasted. 

Realizing a need to understand net- 
work administration from a lower level, 
I turned to Linux Network Administra- 
tor’s Guide, Second Edition, by Olaf 
Kirch and Terry Dawson (O'Reilly & As- 
sociates, 2000). This marvelous book is 


cces 
os 


sible An 


available online at http://www.oreilly 
.com/catalog/linag/. Being old fashioned, 
I prefer the printed version, which works 
a lot better out here in the hinterlands 
where the Internet doesn’t reach. You 
probably shouldn’t start with this book, 
but you should eventually get to it. Af- 
ter muddling your way through under- 
standing the architecture and not getting 
it right and not getting your network run- 
ning the way you want it to, and with 
only enough knowledge to get yourself 
into trouble, this book is for you. So far, 
the book has handled all my needs with 
Linux network administration. Highly 
recommended. 
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Rating Real Time: 


Design Points 
Ed Nisley 


n automotive design team can pro- 
duce a Formula One race car or a 
minivan, depending on the design 
point they’re given. Although either 
vehicle can fetch groceries, only one han- 
dles speed bumps with aplomb. Con- 
versely, power sliding doors won’t cut it 
in the next Grand Prix. 

When engineers and programmers sit 
down to design a project, they have in 
mind both a typical use and a typical user. 
That mindset, the project’s design point, 
determines what’s included and, perhaps 
more importantly, what’s left out of the fi- 
nal product. Four tires, check. A Formu- 
la One passenger seat, nope. 

As with automobiles, so too with soft- 
ware. The initial design point determines 
the project’s far-distant future, because 
fundamental design decisions make some 
subsequent changes trivial and others ex- 
ceedingly difficult. Several vendors dis- 
covered, to their evident surprise, the com- 
plexities of grafting a windowing user 
interface atop DOS on the good old IBM 
PC and its successors. Their projects met 
with mixed success, even if some survived 
long enough to encounter troubles with 
their own initial design points. 

To get a better idea of why this hap- 
pens, let’s take a look at some design 
points. We'll begin with the silicon that 
makes it all possible. 





Hardware Matters 
At the most basic level, the hardware de- 
sign point determines what you can ac- 
complish. You could, I suppose, emulate 
a 32-bit CPU using nothing but the 8085 
in that logic analyzer we met last month. 
Talk about an exercise in futility! 

The Transmeta saga provides a con- 
temporary example of how a particular 
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hardware design point plays out. You'll 
recall that Transmeta began with the in- 
tent of building an Intel-compatible CPU 
based on a VLIW (Very Long Instruction 
Word) architecture, using firmware to in- 
terpret x86 instructions and optimize the 
resulting sequence of microinstructions 
(they pronounce this “Code Morphing”). 
Their goals were to achieve roughly equiv- 
alent performance with Intel processors 
for a given CPU clock frequency, with sig- 
nificantly reduced power consumption in 
smaller and simpler hardware. 

The first version of their Crusoe chip, 
the TM3200, came as a serious surprise. 
They evidently based their hardware and 
firmware architecture on the assumption 
that most contemporary x86 software fa- 
vored 32-bit opcodes. That turned out not 
to be the case for either Windows or its 
applications (the only code that matters in 
the consumer marketplace today), which 
ran with, shall we say, serene dignity. 

The uniprocessor variant of Amdahl’s 
Law tells us that when your CPU has a 
bunch of peppy instructions with a few 
dogs mixed in, them dogs gonna getcha. 
For example, if almost all instructions run 
in one clock cycle, but a few instructions 
soak up 100 cycles just 1 percent of the 
time, the average instruction requires two 
cycles. Those rare instructions add 100 per- 
cent to your cycles-per-instruction num- 
ber and hack your performance in half! 

Which is essentially what nailed Trans- 
meta. Despite claiming that they could fix 
problems and add optimizations just by 
tweaking the x86 interpreter firmware, 
their 32-bit-specific hardware-design point 
set the TM3200’s performance. The fact 
that it remains a subpar performer indi- 
cates that firmware can only accomplish 
so much on a given hardware substrate. 

Their later TM5400 and TM5600 Crusoe 
chips embody different implementations 
of the same VLIW architecture. Despite 
their hard-won knowledge, the overall per- 
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formance remains well below expectation: 
Code Morphing evidently imposes far more 
overhead than they expected. This implies 
that the original VLIW design point doesn’t 
match up well with the realities of high- 
speed x86 instruction execution. 

Transmeta is not alone, however. If you 
take a good, hard look at the CPU chips 
and cores now promoted for embedded 
use, you'll recognize quite a few that be- 
gan life as desktop engines or x86 killers. 
After discovering how the desktop mar- 
ket values compatibility and performance 
above all other factors, the chips wound 
up in the embedded market, where they 
compete fiercely on speed and power 
consumption. 

Homework assignment: Read up on the 
history of RISC CPUs, then write an essay 
describing code compaction and explain- 
ing why it took on added significance in 
this era of unlimited RAM. Extra credit: 
Chart the history of power conservation 
measures and explain why “sleep mode” 
remains so deadly on desktop PCs. 


Kernel Concerns 

Snuggled up against the hardware lies the 
operating system. Inside the OS kernel, 
the code that governs the most funda- 
mental actions of the system, you'll find 
the original OS design point affecting all 
subsequent decisions. 

When you begin laying out an OS ker- 
nel with embedded and real-time capa- 
bilities, you inevitably favor the faster over 
the slower, the specific over the general, 
and the simple over the complex. The re- 
sulting kernel emerges both smaller and 
simpler than a general-purpose OS, with 
many functions either moved elsewhere 
or simply omitted. 

You will find that hardware registers 
and capabilities determine the size of 
buffers, the length of messages, and the 
number of status bits available. Fixed ar- 
rays and known-size buffers will replace 
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intricate chains of pointers and complex 
allocation schemes. Nearly always, speed 
and predictability trump anything else. 

In fact, for embedded systems below a 
certain level of complexity, you may de- 
cide you can get along just fine without 
a formal operating system at all. Interrupts 
can go directly to the routine that handles 
them, task dispatching reduces to an end- 
less loop inside main), and interprocess 
communications uses shared variables. It’s 
been done, even if it’s not mentioned in 
polite company. 

Although it’s hard to imagine in these 
days of gigahertz CPUs and gigabyte RAM, 
many embedded applications must pro- 
cess real-time events using only countable 
amounts of RAM and glacial clocks. Un- 
der those conditions, a true operating sys- 
tem may be an unaffordable luxury. Chill- 
ing thought, no? 

Because embedded projects span such 
a range of capabilities, with real-time sys- 
tems just a subset of those, you'll find suit- 
able OS kernels defined more by what 
they leave out than by what’s included. If 
the OS design point included everything, 
removing the kitchen sink may rip out 
some vital toilet plumbing. 

For example, even though your project 
has no need of a file system, the OS may 
assume it boots from a disk. Or it may as- 
sume virtual memory is always available 
and always pages to disk. Or it may have 
another gotcha that’s possible to work 
around after it smacks you upside the 
head at the least opportune moment. 

Better, perhaps, to start with a mini- 
mal system and add components in a 
building-block manner. If you must nit- 
comb a larger system, you generally wind 
up removing functions until it stops work- 
ing, then add that last hunk back in again. 

Or maybe not. It’s a matter of match- 
ing the OS design point with yours. 


Code Distillation 

The OS kernel is not, by itself, the entire 
OS. There are at least three additional 
layers: Capabilities added to the kernel, 
interface routines providing access to 
them, and utilities that make use of ev- 
erything else. Once again, the original 
OS design point determines how that 
code will perform. 

A key difference between embedded 
and desktop developers lies in mindset. 
When you’re writing for a system with- 
out all the modern conveniences (no sink, 
outdoor toilet), you tend to write small- 
er and tighter code because you know 
what’s vital. Desktop code may not nec- 
essarily be bigger and slower, but that’s 
the way to bet. 

Knowing that programs have access to 
essentially unlimited amounts of memory 
and fast processors definitely simplifies 
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software development at the cost of larg- 
er memory footprints, more complex OS 
internals, and substandard execution on 
anything less than current hardware. Pro- 
grams and utilities written for embedded 
applications tend to assume the con- 
verse — limited memory, slower CPUs, and 
fewer services. 

Now, while it’s difficult to come up with 
reliable with-and-without figures for some- 
thing like an OS kernel, we can compare 
near-OS code and utilities from the Linux 


Many embedded 
applications must 
process real-time 
events using only 


countable amounts 
of RAM 





arena. Keep in mind that your mileage 
will definitely vary! 

Consider this data point: The uClib run- 
time libraries sweat a statically linked ver- 
sion of the “Hello world!” Ur-program 
down to 2 percent of its usual size by 
omitting features and functions that desk- 
top programmers take for granted and 
embedded programmers rarely use. 

In general, static linking isn’t a win be- 
cause desktop programs assume the stan- 
dard shared libraries are available through 
dynamic loading. Unless you omit all those 
programs, you must keep both the libraries 
and the OS loading facilities. However, for 
embedded systems that run only a very 
few, very carefully controlled programs, 
static linking pays off. As always, it de- 
pends on your system’s design point. 

The BusyBox project provides a sec- 
ond data point. BusyBox combines the 
myriad GNU command-line utilities re- 
quired to actually get something done on 
a UNIX system into a single executable. 
That file may be a 400-KB hunk when it’s 
statically linked with uClib, but it replaces 
100-odd separate files, each weighing 
around 40 KB. 

Do the Math! 

For both uClib and BusyBox, the new 
code’s design point included “small and 
simple” and excluded “general purpose.” 
Combining similar features, reusing code, 
and eliminating seldom-used functions 
produced much of the compaction. The 
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ensuing code became smaller and faster, 
even though it shares a common heritage 
with the usual sources. 

Yes, a different design point can pro- 
duce strikingly different results! 


Reentry Checklist 

The Protean nature of software might lead 
you to believe that mutating a desktop OS 
into a real-time, embedded contender re- 
quires just a few tweaks. With the notion 
of a design point in mind, next time we'll 
examine just what’s involved. 

I'd like more fundamental knowledge 
of how the Crusoe’s hardware and firm- 
ware work, but after more than a year, 
the Transmeta web site (http://www 
.transmeta.com/) still says they’ll make the 
Crusoe hardware-designer information 
package more widely available Real Soon 
Now, and they have not yet announced 
performance numbers. My opinions are 
based on experience I’ve had with VLIW 
machines and what I’ve read here and 
there on the Web. 

Linux aficionados can get an idea of 
what an entirely different OS design point 
looks like by reading through the QNX 
manuals and white papers (http://www 
.qnx.com/). You'll see considerable cross- 
pollination above the kernel level, but 
striking differences below that line. As they 
point out, “POSIX” doesn’t necessarily 
mean “UNIX.” 

The Linux Documentation Project pro- 
vides kernel doc at http://www.linuxdoc 
.org/. The background info on tweaking 
and tuning also shows off some of the ini- 
tial kernel-design points. Of course, you 
can always peruse the kernel source itself 
from any distribution or fetch it online 
from http://www.kernel.org/. 

A graph showing the BusyBox exe- 
cutable file size against time appears at 
http://busybox.lineo.com/. In this case, 
you can’t call it “feature creep” because 
the design point specifically includes mod- 
ularity, making it easy to configure exactly 
what you need and omit features that you 
won't use. 

The uClinux project lives at http:// 
www.uclinux.org/, which leads you to the 
uClib code. 

Search on +"Seymour Cray" +"virtual 
memory" for a dissenting voice on the ben- 
efits of virtual memory. He was talking 
about supercomputers, but don’t ignore 
the key point. Great men create pithy say- 
ings; Cray was no exception. 

The IEEE provides a remailing service 
for members, so despite being a member 
for these many years, the ieee.org domain 
in my email address doesn’t imply I ei- 
ther work for or represent them. 


DDJ 
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ls JavaScript an 


Object-Oriented Language? 


Nadine McKenzie 


ecause JavaScript is not strongly typed 
and does not support class-based in- 
heritance, it is easy to dismiss it as 
@ not being an object-oriented lan- 
guage. In truth, JavaScript is an object- 
oriented language that utilizes prototype- 
based inheritance rather than class-based 
inheritance 

In addition to its inheritance capabili- 
ties, JavaScript has other, albeit underused, 
object-oriented strengths such as encap- 
sulation. The concept of encapsulation 
(that is, making an object’s data members 
private and manipulating that data using 
methods) can be easily transferred to 
JavaScript. Just as a typical Java class has 
a constructor, member variables, and meth- 
ods, this class structure can be mirrored 
with JavaScript. This is done by what is 
known in JavaScript as a “nested func- 
tion’— a function that houses one or 
many inner functions. 





function OuterMost() {...function nestedQ 


JavaScript refers to nested functions in 
this context as “methods.” For simplicity, 
I refer to the outermost function as a class. 
Listing One is an example of a JavaScript 
class called MyClassName. While the syn- 
tax is notably different, the underlying 


Nadine is an independent contractor and 


a Sun Certified Java Programmer. She can 
be contacted at nadine@nadtech.com. 
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structure achieves the same end goal: en- 
capsulation. 

In Java, there is a special kind of 
method called a “constructor,” whose pri- 
mary purpose is to allocate storage for 
the object and assign initial values to its 
properties. In JavaScript, a constructor 
also exists and even works in the same 
way, but is not as obvious when exam- 
ining the language syntax. This is because 
JavaScript does not contain a formal con- 
structor. Instead, the outermost function 
serves as a constructor. Consequently, it 
makes sense to place all initialization code 
after the property declarations (as in List- 
ing One). 

Following the property declarations and 
object initializations are the method point- 
ers. To handle scoping issues between the 
outermost function and its nested func- 
tions, method pointers are used. Bridging 
this scoping gap, in turn, lets the method 
have public access. For every nested func- 
tion that will have public access, a corre- 
sponding method pointer must exist (as 
in Listing One). Omitting the method 
pointer will make the method private. This 
comes in handy and is an important fea- 
ture of encapsulation. Also, for scoping 
reasons, the object properties are inten- 
tionally not preceded by the keyword this. 
Omitting the keyword allows them to re- 
main private. Using the this keyword only 
serves to make the object property pub- 
lic, thereby defying encapsulation. 


Dr. Dobb’s Journal, August 2001 





Listing Two shows how to create an 
object instance and call its methods. The 
HTML script tag’s src attribute Cline 3 of 
Listing Two) is used to include the con- 
tents of myclassname.js (see Listing One) 
into the HTML file. The rest of the code 
should look familiar since it closely par- 
allels object creation and method access 
in Java. This approach is ideal when cre- 
ating JavaScript widgets. But don’t stop 
there, JavaScript also pairs well with 
many design patterns. So why hasn’t this 
been available since the onset of 
JavaScript? The answer is that nested 
functions have only been around since 
JavaScript 1.2. 

Adopting this approach makes code 
easier to read, more maintainable, and 
achieves better reuse. In addition, devel- 
opers coming from an OO background 
will find the transition to JavaScript 
smoother. Be conservative, however, be- 
cause this approach can also slow things 
down when used excessively. 

Some final words. The ECMAScript 
specification, to which JavaScript must 
conform, lists future reserved keywords 
for proposed extensions. Among them 
are: class, import, super, and extends. 
Sound familiar? A glimpse of tomorrow 
hints that JavaScript may become even 
more Java-like. 


DDJ 
(Listings begin on page 116.) 
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Listing One 
/* myclassname.js */ 


function MyClassName(parameterA, parameterB) 


// --- Object Properties --- 
propertyR = "propertyR is read only."; 


// --- Object Initialization --- 
propertyA = parameterA; 
propertyB = parameterB; 


// --- Method Pointers --- 
this.getPropertyA = _getPropertyA; 


this.setPropertyA = _setPropertyA; 
this.getPropertyB = _getPropertyB; 
this.setPropertyB = _setPropertyB; 
this.getPropertyR = _getPropertyR; 


this.doSomeAction1 = _doSomeAction1; 


// --- Methods --- 
function _getPropertyA() 
{ 


return propertyA; 
function _setPropertyA (para) 
propertyA = para; 
function _getPropertyB() 
return propertyB; 
alite _setPropertyB (para) 
propertyB = para; 
function _getPropertyR() 
return propertyR; 
Pence _doSomeAction1 () 
alert (aPrivateMethod() + " " + this.getPropertyR()); 


function aPrivateMethod () 


return "aPrivateMethod() is a private method."; 


e 
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Listing Two 


<html> 
<head> 
<script src="myclassname.js" language="JavaScript"></script> 
<script language="JavaScript"> 
/ object declaration and instantiation 
var objMyClassName = null; 
objMyClassName = new MyClassName("A", "B"); 


// accessing the newly created objects methods 

alert (objMyClassName. getPropertyA()) ; // Displays "A" 
alert (objMyClassName.getPropertyB()); // Displays "B" 
objMyClassName.setPropertyA("Y") ; 
objMyClassName.setPropertyB("Z") ; 

alert (objMyClassName.getPropertyA()); // Displays "Y" 
alert (objMyClassName.getPropertyB()) ; // Displays "2" 


// Displays aPrivateMethod() is a private method. propertyR is read 


only. 


objMyClassName.doSomeActiont1 () ; 


// Displays Undefined - propertyB is private 
alert (objMyClassName.propertyB) ; 


// Fails, aPrivateMethod() is private. Netscape quietly bombs. 
alert (objMyClassName.aPrivateMethod()); 


</script> 
</head> 
<body> 
<!-- body tags here --> 
</body> 
</html> 
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Orthogonal Sequences 


William Stallings 


key element in spread spectrum 
communication is the use of a 
spreading sequence. The spread- 
ing sequence, c(t), is a sequence 
of binary digits shared by the transmit- 
ter and receiver. In the common scheme 
known as direct-sequence spread spec- 
trum (DSSS), spreading consists of mul- 
tiplying (XOR) the input data by the 
spreading sequence, where the bit rate 
of the spreading sequence is higher than 
that of the input data. When the signal 
is received, the spreading is removed by 
multiplying with the same spreading 
code, exactly synchronized with the re- 
ceived signal. 

The resulting data rate is consequently 
that of the spreading sequence. This in- 
creases the transmitted data rate and there- 
fore increases the required bandwidth. 
The redundancy of the system is also in- 
creased. The spreading codes are chosen 
so that the resulting signal is noise-like; 
therefore, there should be an approxi- 
mately equal number of ones and zeros 
in the spreading code and few or no re- 
peated patterns. 

Several things can be gained from this 
apparent waste of bandwidth: 





William is a consultant, lecturer, and au- 
thor of over a dozen books on data com- 
munications and computer networking. 
His most recent book is Wireless Com- 
munication and Networks (Prentice-Hall, 
2001). He can be contacted at http:// 
www.wiliamstallings.com/ or ws@shore.net. 
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e You can gain immunity from various 
kinds of noise and multipath distortion. 
The earliest applications of spread spec- 
trum were military, where it was used 
for its immunity to jamming. 

e It can also be used for hiding and en- 
crypting signals. Only a recipient who 
knows the spreading code can recover 
the encoded information. 


¢ Several users can independently use the 


same higher bandwidth with very little 
interference. This property is used in 
cellular telephony applications, with a 
technique known as code division mul- 
tiplexing (CDM) or code division mul- 
tiple access (CDMA). 


When spreading codes are used in a 
CDMA application, there is the further re- 
quirement of lack of correlation. When 
multiple signals are received, each spread 
with a different spreading code, the re- 
ceiver should be able to pick out any in- 
dividual signal using that signal’s spread- 
ing code. The spread signals should 
behave as if they were uncorrelated with 
each other, so that other signals will ap- 
pear as noise and not interfere with the 
despreading of a particular signal. Because 
of the high degree of redundancy pro- 
vided by the spreading operation, the de- 
spreading operation is able to cope with 
the interference of other signals in the 
same bandwidth. : 

For example, Figure 1 illustrates a sim- 
plified scheme for CDMA encoding and 
decoding. There are seven logical chan- 
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nels, all using DSSS with a spreading 
code of 8 bits. Assume that all sources 
are synchronized. If all seven sources 
transmit a data bit, in the form of an 8-bit 
sequence, the signals from all sources 
combine at the receiver so that two pos- 
itive or two negative values reinforce and 
a positive and negative value cancel. To 
decode a given channel, the receiver 
multiplies the incoming composite sig- 
nal by the spreading code for that chan- 
nel, sums the result, and assigns binary 
1 for a positive value and binary 0 for a 
negative value. In this example, the 
codes are C0=00000000; C1=01010101; 
C2=00110011,; C3=01100110,; C4=00001111; 
C5=01011010; C6=00111100. It is more 
convenient here to use +1 and —-1 to rep- 
resent the two binary digits 1 and 0, as 
in Figure 1. 

Two general categories of spreading 
sequences have been used: PN (pseudo- 
noise or pseudorandom number) se- 
quences and orthogonal codes. PN se- 
quences are the most common ones used 
in FHSS systems and DSSS systems not 
employing CDMA. In DSSS CDMA sys- 
tems, both PN and orthogonal codes have 
been used. In this article, I'll examine or- 
thogonal sequences. 


Orthogonal Codes 

Unlike PN sequences, an orthogonal code 
is a set of sequences in which all pairwise 
cross correlations are zero. An orthogo- 
nal set of sequences is characterized by 
the equality in Example 1(a), where / is 
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the length of each of the sequences in the 
set, >; and 6; are the ith and jth members 
of the set, and ¢ is the bit duration. 

Both fixed- and variable-length or- 
thogonal codes have been used in 
CDMA systems. For the CDMA applica- 
tion, each mobile user uses one of the 
sequences in the set as a spreading code, 
providing zero cross correlation among 
all users. 


Walsh Codes 

Walsh codes are the most common or- 
thogonal codes used in CDMA applica- 
tions. A set of Walsh codes of length n 
consists of the n rows of an nxn Walsh 
matrix. The matrix is defined recursively 
as in Example 1(b), where 7 is the di- 
mension of the matrix and the overscore 
denotes the logical NOT of the bits in the 
matrix. The Walsh matrix has the proper- 
ty that every row is orthogonal to every 
other row and to the logical NOT of ev- 
ery other row. 

Figure 2 shows the Walsh matrices of 
dimensions 2, 4, and 8. It is easily seen 
that a bitwise multiplication of any two 
rows produces 0. For example, in the 8x8 
matrix, row 3 multiplied by row 4 equals L_ 
1+(—1)+1+(-1)+1+(-D+1+(-1)=0. The ex- Example 1: (a) An orthogonal set of : 
ample of Figure 1 uses Walsh codes from sequences; (b) Walsh codes; Written by 
the 8x8 matrix. (c) variable-length orthogonal codes. 
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Figure 1: Seven-channel CDMA encoding and decoding. 
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Figure 2: Walsh matrices. 


bttp://www.ddj.com Dr. Dobb’s Journal, August 2001 119 


Orthogonal spreading codes such as the 
Walsh sequences can only be used if all of 
the users in the same CDMA channel are 
synchronized to the accuracy of a small frac- 
tion of one chip. Because the cross corre- 
lation between different shifts of Walsh se- 
quences is not zero, if tight synchronization 
is not provided, PN sequences are needed. 


S : CQ=H11—11) 
a 7 alg a(8)= 1, 1, A, - 1- 1-14 Le / . 


Variable-Length 

Orthogonal Codes 

Third- generation mobile CDMA systems 
are designed to support users at a num- 
ber of different data rates. Thus, effective 
support can be provided by using spread- 
ing codes at different rates while main- 
taining orthogonality. Suppose that the 


“othe of A a os - ot aD 
\CgQ= 111,114, a « 


- C,(8)= AA tae 141} _ 
| i OgQiet-t—1.44) _ 


AC 1.114.141 =) ee 
& B=, 1, “1, i. “1, 1, -1 1. 
| C,(6)=(-1, 1 af, 44 4 1 a 
Oy(TA.At 1 aa 1 th 








Figure 3: Generation of variable-length orthogonal codes. 
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minimum data rate to be supported is Ryn 
and that all other data rates are related by 
powers of 2. If a spreading sequence of 
length N is used for the R,,;, data rate, 
such that each bit of data is spread by 
N=2" bits of the spreading sequence 
(transmit the sequence for data bit 0; trans- 
mit the complement of the sequence for 
data bit 1), then the transmitted data rate 
is NR»min. For a data rate of 2R, jn, a 
spreading sequence of length N/2=2”! 
will produce the same output rate of 
NXRmin. In general, a code length of 2”- 
is needed for a bit rate of 2’R, nin. 

A set of variable-length orthogonal se- 
quences is readily generated from Walsh 
matrices of different dimensions. Define the 
NXN matrix Cy as consisting of N rows, 
each of length N, labeled C){1), Cy(2), ..., 
Cy(N). With the limitation that NV is a pow- 
er of 2, the matrix Cy can be recursively 
generated from the matrix Ch as in Ex- 
ample 1(c). 

Starting with the 1x1 Walsh matrix {—1}, 
Figure 3 shows the results through Cg. 
Note that each matrix in Figure 3 contains 
all of the rows of the corresponding Walsh 
matrix, but in a different order. This or- 
dering is significant for the following rea- 
son. Each row C(t) of Cy has the fol- 
lowing properties: 


¢ Cy(i) is orthogonal to all other rows 
of Gn: 

¢ Cy(V) is not orthogonal to a row on the 
same path of the tree as itself. 

¢ C\(V is orthogonal to all other rows. 


For example, Cg(4) is not orthogonal to 
C42), C201), C67), Cy6(8), or any other 
row that appears on the same tree path as 
itself. If a channel requires a mixture of 
spreading codes to accommodate different 
data rates, the selection must be made to 
avoid rows that are on the same tree path. 


Multiple Spreading 

When sufficient bandwidth is available, 
a multiple spreading technique can prove 
highly effective. A typical approach is to 
spread the data rate by an orthogonal 
code to provide mutual orthogonality 
among all users in the same cell and to 
further spread the result by a PN se- 
quence to provide mutual randomness 
(low cross correlation) between users in 
different cells. In such a two-stage 
spreading, the orthogonal codes are re- 
ferred to as “channelization codes,” and 
the PN codes are referred to as “scram- 
bling codes.” For example, this technique 
is used in the IS-95 Standard. IS-95 is the 
most widely used second- generation 
CDMA scheme, primarily deployed in 
North America. 
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DR. ECCO’S OMNIHEURIST CORNER 


Perimeters 


Dennis E. Shasha 


Ithough he was two decades too 
old to fly fighter jets, the Air Force 
General Chris Warren had the con- 
fident smile and aviator glasses of 
a test pilot. 

“We’ve got a base to protect and we 
mean to do it,” he said. “Sadly, even the 
best guards get bored, so we need sev- 
eral of them. Please look at this map.” 

He laid a piece of paper on a table and 
drew a big square. 

“Our base is a square 76 meters on a 
side on a flat desert,” he said. “Our most 
sensitive equipment (I can’t tell you what 
it is) will be kept in a depot underground 
having a surface footprint of just a few 
square meters. We want the guards to alert 
Our antiterrorist team if anyone approaches 
the depot. I pity the poor souls who get 
their attention. 

“We have five guard towers to put up. 
From each guard tower, a guard can reli- 
ably see 18 meters. But each guard isn’t 
looking everywhere all at once, of course, 
so we'll want several to cover some areas 
near the depot. We want the guards to be 
able to yell to one another, but not have 
such easy conversation that they forget 
what they are doing. So, the towers should 
be at least six meters apart. 

“We want at least 2900 square meters 
to be visible to one or more guards 





Dennis, a professor of computer science at 
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Puzzling Adventures of Dr. Ecco (Dover, 
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Principled Approach (Prentice Hall, 1992), 
(coauthored with Jason Wang and Bruce 
Shapiro) Pattern Discovery in Biomolecular 
Data: Tools, Techniques, and Applications 
Oxford University Press, 1999); and (coau- 
thored with Cathy Lazere) Out of Their 
Minds: The Lives and Discoveries of 15 Great 
Computer Scientists (Springer Verlag, 1998). 
He can be contacted at DrEcco@ddj.com. 
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(though different square meters can be 
visible to different guards). At least 775 
should be visible to three guards or more. 
(The original 2900 will include these 775, 
of course.) At least 100 should be visible 
to four or five guards.” 

“What does it mean for a meter to be 
visible? The whole meter, or just some por- 
tion?” Liane asked. 

“For the purposes of this mission,” the 
general answered, surprised by the 13- 
year-old’s question, “assume that each 
guard stands in the middle of a square 
meter (in a tower) and that seeing anoth- 
er square meter means seeing the middle 
of that square meter. 

“Your assignment is to tell me where 
the towers should be and tell me how 
many meters are visible and by how many 
guards. If you can do much better than 
my specification, I’d be grateful.” 

Liane and Ecco found a design to sat- 
isfy the general’s desires, but weren’t sure 
they were doing as well as possible. 


Reader: See how well you can do. 


Last Month’s Solution 

These are the groupings of front frag- 
ments (groups separated by commas): 18 
11 20, 3.12 29, 16,19 13, 10.6 17 8:5,.14 
41517. 

These are the groupings of back frag- 
ments: 5 6, 11 16 9 17 10, 1 14, 4 13 15, 8 
2,37 12. 

The lengths are respectively: 5.2, 11.3, 
4.7.9, 11.4: 8:1: 

This is at least consistent with the an- 
cient texts. 


Reader Notes 

Several clever readers matched Liane’s so- 
lution to the “Beats” puzzle (DD/, May 
2001) when there were three police cars. 
Most of these went on to show that all five 
cars could follow one another in a circuit 
over all of the roads so some car would 
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pass each block every 10 minutes: Richard 
W. Lipp, Tim Chase, Dennis Yelle, Pike 
Enz, Andrew D. Todd, John A. Trono, Rod- 
ney Meyer, Mike Bandy, James H. Puttick, 
Steve Kempf, Charles Doumar, Tyler Fol- 
som, Craig Ewert, Paul Valckenaers, Matt 
Cuba, Geoffrey Probert, Jimmy Hu, Mike 
Ball, Steven M. Gollmer, Patrick Gaffney, 
Robert Murr, Mike Wilson, M. Alan Megh- 
rigian, Dennis L. Cumro, Magne Ostlyngen, 
Robert Huff, Robert H. Morrison, Girts Folk- 
manis, Robert Bigec, Steven Augart, Denis 
Birnie, Wim Nuij, Mike Larsen, Norman 
Roundy, and Rob Holstein. 

Pike Enz was the first to show that ev- 
ery street could be hit every 13 minutes 
or less by the same car. Here was his so- 
lution: 


“Identifying vertices by x-y-coordinates with 
the origin in the lower left corner: 


“1) an 8-minute circuit: 
(1,4) (0,4) (0,3) (0,2) (1,2) (2,2) (2,3) (2,4) 
and back to (1,4) 


“2) an 8-minute circuit: 
(2,2) (2,) 2,1) G,0) (2,0) 2,1) B,D G,2) 
and back to (2,2) 


“3) a 9-minute circuit: 

a (3,0) (4,0) (4, (4,2) (4,3) 3,3) G,2) 
2 

and back to (3,1) 


“4) a 12-minute circuit: 

(1,3) (2,3) G,3) B,® G,5) G,6) G,7) (2,7) 
(2,6) (1,6) 

(1,5) 1,4) 

and back to (1,3) 


“S) a 13-minute circuit: 

(2,4) 3,4) (4,4) (4,5) (4,6) G,6) (2,6) (1,7) 
(1,6) (0,6) 

(0,5) (1,5) (2,5) 

and back to (2,4).” 
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PROGRAMMER’S BOOKSHELF 


A G++ Duo 


Peter Roth 


hese days, software producers no 

longer print kilopages of documenta- 

tion, but instead concentrate their ef- 

forts on online help. Given the 
breadth that software assumes these days, 
this is a reasonable approach. 

Borland and RogueWave, for instance, 
produce the online help files for the Bor- 
land C++Builder toolset (RogueWave pro- 
vided the STL for this version of BCB). The 
help files are comprehensive, and as far 
as I can tell, completely subsume any print- 
ed documentation. However, help files are 
not a book, with everything that implies. 

Product-related books are a form of 
printed documentation produced by third 
parties who, while not directly responsi- 
ble for producing the software, are usu- 
ally given access to the product prior to 
its release so the product and book 
launches occur nearly simultaneously. 

C++Builder 5 Developer’s Guide is one 
of few Borland C++Builder books avail- 
able at your bookseller, but before I say 
more about it, let me briefly introduce Ver- 
sion 5 of Borland C++Builder itself. 

Borland C++Builder (BCB) is the inte- 
grated development environment and 
toolset Borland markets for C++ develop- 
ers writing for Windows. The compiler is 
available alone as a free download from 
the Borland web site. This is the command- 
line compiler; you may also download the 
Turbo Debugger, but no other tool ac- 
companies the download. 

BCB was built to make Windows pro- 
gramming easier. It uses the Borland Del- 
phi Visual Component Library (VCL), a 
framework that wraps the Windows API 
in a far more facile way than does MFC. 
The VCL is written in Object Pascal, the 


Peter is president of Engineering Objects In- 
ternational, producers of commercial and 
custom Delphi and C++ components. He can 
be contacted at http://engineeringobjects.com/ 
and peteroth@erols.com. 
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eae of Borland’ S Delphi eae 
Hence, BCB is polyglot, compiling C, C++, 
and Object Pascal. 

Borland has added a few extensions to 
the C++ language to make the use of the 
VCL easier. I use BCB to produce what I 
imagine to be standard C++ because I only 
use the extensions when accessing the VCL. 

C++Builder 5 Developer’s Guide, by Jar- 
rod Hollingworth, Dan Butterfield (edi- 
tor), Bob Swart, and Jamie Allsop, has 
about 30 contributors, some of whom had 
about two years of C++ experience at the 
time of writing. The text is published with 
a CD-ROM, which contains the complete 
text of the book, a trial version of BCB 5, 
source-code examples, and the usual 
third-party promotional material. 

Unfortunately, there are differences 
between the printed page and the CD- 
ROM version. In the print version, the 
front matter is labeled with Roman nu- 
merals, for instance, whereas the CD ver- 
sion counts pages as if the book start- 
ed numbering all pages in Arabic. 
Hence, all the CD page numbers differ 
from the printed page by at least 42. Re- 
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ferring from one to the other is not 
straightforward. Further, there is no ac- 
tive index in the CD version, so the only 
way to find a topic is by using the 
Adobe Acrobat Reader find command. 
However, the second printing of the 
book included a number of corrections 
for the first printing, including modifi- 
cations to the CD. For example, the nav- 
igation pane of the second printing has 
hyperlinks, letting you find a particular 
chapter or topic. 

The C++ used in C++Builder 5 Devel- 
oper’s Guide is “a better C.” There’s not 
much devoted to templates, and the 15- 
page discussion of the Standard C++ Li- 
brary appears on the CD-ROM only. I ad- 
vise skipping the chapter, as it is incorrect 
on specifics and too shallow to do the 
subject justice. 

Code quality is uneven, but that is to 
be expected in a work of this size, with 
a variety of experience among the authors. 

In some cases— and I never thought 
I’'d see myself write this— there are too 
many comments: Classes end with /;//The- 
ClassName, each member ends with 
}//memberName, and so on. This is just 
clutter. Sometimes, class membership is 
denoted by a leading underscore, a prac- 
tice best avoided because it reduces po- 
tential clashes with the Standard Library. 
There is some bizarre formatting, a mix 
of tabs and spaces. You must have your 
tab stops set at 4, or else the sample code 
will be a pain to read. 

BCB supports the use of more tech- 
nologies than I will ever use. C++Builder 
5 Developer’s Guide is reflective of the 
product, and given finite time, there is no 
way to intelligently comment on every- 
thing. Having used Borland tools consis- 
tently since Turbo Pascal 2, I skimmed the 
chapters on the IDE, debugging, and 
building components, but these chapters 
will be of interest to someone just pick- 
ing up the product. 
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Other areas addressed: COM, DCOM, 
COM+; communications, web and database 
programming; the use of Borland’s MIDAS 
multitier product; ActiveX and CORBA tech- 
nologies; and integrating applications with 
Microsoft Office. Given the page limitations 
and the number of topics covered, you 
should keep the “Guide” aspect of the ti- 
tle in mind; coverage is wide, but not deep. 

The presentations I found most inter- 
esting were: 


e TeamSource, the source-code control 
system used internally at Borland. 

e Software distribution: what to ship, how 
to sell, where to advertise, and so on, 
in Chapter 28 (CD). 

e Tips and tricks, Chapter 31 (CD) had a 
discussion of floating-point math that 
should be required reading for all new- 
bies. It might minimize the newsgroup 
posts entitled “Why is 1/3+1/3+1/3!=1?” 

e A real-world example, for engineers, in 
Chapter 31 (CD). I would have put this 
at the beginning of the book because it 
demonstrates how easily a Windows app 
can be generated with BCB. 


Appendix A is a convenient and useful 
compendium of reference material: web 
sites, books, magazines, newsgroups, and 
the like, related to C++ and BCB. 

All in all, C++Builder 5 Developer's Guide 
is an uneven book, but it’s the first one I’d 
recommend when you need a quick intro 
to a specific technology using BCB. 

The first sentence of the Preface in C++ 
and Object-Oriented Numeric Computing 
for Scientists and Engineers, by Daoqi 
Yang, declares the aim of the book to be 
an easy and concise introduction to C++. 
Unfortunately, the text falls short. The first 
four tedious chapters discuss the minuti- 
ae of the language syntax. There is real- 
ly nothing specific to C++ here, and the 
text could easily apply to any procedural 
language with a C syntax: “Here’s how to 
iterate, let’s use iteration to solve a nu- 
merical problem,” and so on. 

Chapter 5 introduces classes, and the 
book becomes more C++ specific at this 
point. The class concept is used to orga- 
nize the solution of ordinary differential 
equations using the Euler and Runge- 
Kutta methods. But the introduction to 
classes is very brief, probably too brief. 
And the constructor initializer list is not 
mentioned. 

In Chapter 6, operator overloading is 
demonstrated with a small complex num- 
ber class followed by vector and matrix 
classes, and the algorithm and C++ code 
for the conjugate gradient solution of si- 
multaneous equations is introduced. Mem- 
ory management in the classes is based 
on a C pointer-to-double rather than on 
an STL vector or valarray. The chapter 
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also includes a section on the efficiency 
gained by delaying evaluation, similar to 
the use of expression templates. 

Up to this point, the example code has 
used a single error function to report pro- 
gram errors and terminate. In Chapter 7, 
the error processing changes from a func- 
tion call to the printing of a message and 
continuing (not stopping)! It seems to me 
that this code could not have been tested 
with anything other than “good” data. 


The class concept ts 
used to organize the 
solution of ordinary 
differential 
equations using the 
Euler and 
Runge-Kutta 
methods 





valarray is described, but not used any- 
where. (I would expect numeric work to 
use valarray more heavily, although I must 
admit to seeing very little written about 
it.) The chapter closes with some inter- 
esting work using expression templates 
that continues the discussion of delayed 
evaluation. The idea is to minimize the 
number of memory fetches and the 
number of temporaries. Good stuff, but I 
would think it more “advanced” than “in- 
troductory.” 

Class inheritance is introduced as an- 
other organizing principle in Chapter 8. A 
discussion of multiple inheritance is also 
included, which I believe is too much in- 
heritance for an introductory text. Full, 
banded, and symmetric matrices are de- 
rived from the earlier matrix class. 

And then the “concise” aim stated in 
the Preface seems to take over. Chapter 9 
on exception handling is too short, at 12 
pages. Exceptions are said to apply to “li- 
brary writers,” which means that students 
will tend to skip the whole idea of ex- 
ceptions. Chapter 10 on containers and al- 
gorithms is too brief an introduction to 
the STL. In Chapter 11, the matrix library 
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is redefined and incorporates solvers for 
various matrix forms. 

There is little to no discussion of the 
problems inherent in floating-point com- 
putations, and there is no error analysis 
of the computations. Some way to esti- 
mate the error of each calculation would 
be preferable. 

Although C++ and Object-Oriented Nu- 
meric Computing for Scientists and Engi- 
neers has a 2001 copyright date, the code 
“looks old,” which probably reflects the 
lag time in writing the book. 

Having cursed make on several systems, 
the line “the make utility...provides easy 
management of compiling source files...” 
brought back memories. Certainly, I would 
not expect many difficulties from make if 
I were only dealing with the few files in- 
volved in the examples. 

There are also several irritations: 


e Inconsistent use of <«xx> and <xxx.h>. 

e Use of j++ where ++/ is apropos. 

e Using namespace std;. 

e The early introduction (Chapter 2) of 
the goto. 

e STL containers and algorithms ignored 
for the most part. 

e Libraries of code are developed, yet they 
fail to use exceptions and use an error 
function instead. 


You might argue, “Yes, these are irri- 
tating, but many, if not most, programs 
written by engineers are short, one-off 
pieces designed to get an answer.” How- 
ever, I’ve had to deal with too many one- 
off engineering programs that became 
zombies: They should’ve been buried long 
ago, but they just would not die! The stu- 
dent learns from and forever emulates a 


poor style; later learning is hobbled be- 


cause the bad way “works,” and more 
zombie programs are unleashed. Argh! 

If you are not familiar with C++, learn 
the language from Accelerated C++: Prac- 
tical Programming by Example, by Andrew 
Koenig and Barbara E. Moo (Addison- 
Wesley, 2000). 

If you are already familiar with C++ 
and need some numerical algorithms, the 
appropriate sections of C++ and Object- 
Oriented Numeric Computing for Scien- 
tists and Engineers may be worth the 
price. I, for example, enjoyed the dis- 
cussions of delayed evaluation and the 
conjugate gradient method. However, one 
final caveat: You might be tempted to 
simply download the code from the au- 
thor’s web site (http://www.math.wayne 
.edu/~yang/) and plug it into a program. 
But if you do so, I suggest you rewrite 
most of it, at least to make it sturdy. 


DDJ 
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OF INTEREST 








CodeWeavers’ CrossOver Plugin lets Lin- 
ux Operating systems running on portable 
and handheld devices access Windows 
plug-ins such as Quicktime and Shock- 
wave Director. The CrossOver Plugin 
works with Netscape, Mozilla, or Kon- 
queror, and does not require any Microsoft 
licensing fees. The program relies upon 
Wine technology (a Windows API that 
runs on Linux) to enable Windows plug- 
ins to run correctly. Other products avail- 
able or forthcoming from CodeWeavers 
include a remote start menu to run PC ap- 
plications from a web pad, and a server 
to host Windows applications for display 
on multiple devices. 

CodeWeavers Inc. 

2550 University Avenue West, Suite 313N 

St. Paul, MN 55114 

800-596-0409 

http://www.codeweavers.com/ 


COM Explorer 2.0 from 4Developers lets 
you explore, manage, and repair Ac- 
tiveX/COM files. The software provides 
a centralized view of COM objects as well 
as detailed information such as GUID, 
TypeLib, version and file information. 
Misbehaved objects (such as objects with 
missing files left as junk in the Registry) 
are clearly marked. A Registry editor is 
fully integrated, allowing developers to 
view and edit COM settings. Also in- 
cluded is a dependency viewer that al- 
lows viewing of all DLL dependencies for 
a COM object and clearly marks missing 
DLL files. COM Explorer is tightly inte- 
grated with Windows Explorer, allowing 
users to quickly register, unregister, or 
access COM controls. It runs on all Win- 
dows systems. 

4Developers LLC 

1030 East El Camino Real, #440 

Sunnyvale, CA 94087 

http://www.4dev.com/ 


Kabira has announced the release of Kabi- 
ra Adapter Factory, which automatically 
generates adapters connecting the Kabira 
ObjectSwitch Server with applications or 
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network equipment via XML, Java, and 
CORBA interfaces. The tool lets you au- 
togenerate adapters to homegrown appli- 
cations and/or a multivendor network 
equipment environment. It also provides 
access to each application’s complete API 
or XML schema. 

Kabira Technologies Inc. 

One McInnis Parkway 

San Rafael, CA 94903 

415-446-5000 

http://www.kabira.com/ 


Borland has released JBuilder 5 and Del- 
phi 6. JBuilder 5, a cross-platform Java de- 
velopment environment, includes inte- 
grated support for XML, an improved tool 
for creation of dynamic data-driven web 
applications, an improved Entity Bean 
Modeler for distributed applications, and 
integration with a number of third-party 
tools and servers. Delphi 6, a rapid ap- 
plication development environment for 
Windows, has been reworked to support 
web services. The new compiler features 
native XML and SOAP capabilities, and an 
included web page designer offers serv- 
er scripting support. Linux developers us- 
ing Kylix can port to Windows through a 
component library common to both Kylix 
and Delphi. 

Borland Software Corp. 

100 Enterprise Way 

Scotts Valley, CA 95066-3249 

831-431-1000 

http://www.borland.com/ 


Century Software Embedded Technologies 
has announced Version 1.1 of its operat- 
ing environment and development toolkit 
for embedded Linux. New wireless ap- 
plications and multimedia are featured 
along with an enhanced toolkit. Version 
1.1 supports wireless connectivity on plat- 
forms including the StrongARM-based 
Compaq iPAQ and ADS Graphics Client, 
National Semiconductor x86 Geode, and 
NEC MIPS Harrier. A graphical network 
configuration utility allows configuration 
of serial PPP, modem, Ethernet, or wire- 
less TCP/IP connections. A new e-mail 
client and enhanced ViewML browser are 
included along with a PIM suite and MP3 
and MPEG 1 players. The SDK includes 
cross-development libraries, an automat- 
ed build platform, source code, and doc- 
umentation. 

Century Software Embedded Technologies 

5284 Commerce Drive, Suite C-134 

Salt Lake City, UT 84017 

801-268-3088 

http://embedded.centurysoftware.com/ 


Jungo has released WinDriver and Ker- 


nelDriver 5.0, which offer APIs for hard- 
ware access and driver code generation 
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capabilities. The graphical wizard now 
supports Linux and Solaris systems as 
well as Windows platforms. Version 5.0 
also adds Remote WinDriver, for ac- 
cessing hardware on remote machines, 
including embedded systems, via any 
network connection. Hardware in the re- 
mote machine (including USB devices) 
can be autodetected, tested, diagnosed, 
and driver source code for the target de- 
vice can be automatically generated. 

Jungo Ltd. 

6 Hagavish Street 

P.O. Box 8493 

Netanya 42504 

Israel 

877-514-0537 

http://www.jungo.com/ 


Four J’s has unveiled its XML Publication 
System, an XML report generator designed 
to transparently integrate various objects 
such as relational data, text, spreadsheets, 
graphics, and bitmaps. It lets applications 
extract data directly from any standard 
database and present it in a variety of for- 
mats. Key features of the XML Publication 
System include its thin client architecture; 
XML-based markup language; barcode 
support (UPC and EAN); multilingual sup- 
port (UFT-8, UFT-16, Multibyte, and Uni- 
code); streaming input via stdin, sockets, 
file, and HTTP; ASCII, GDI, and Postscript 
support; pluggable user-supplied render- 
ing objects (COM or JavaBeans); and 
bitmap support for various formats. The 
report generator runs on UNIX and Win- 
dows platforms. 

Four J’s America Inc. 

320 Andover Park East, Suite 210 

Seattle, WA 98118 

800-527-2811 

http://www.4jsamerica.com/ 


InstallAnywhere 4 from Zero G builds 
multiplatform installers capable of de- 
ploying and configuring software to near- 
ly any client or server platform. Many 
platform-specific installation details can 
be handled automatically, and the utility 
can install RPMs, manage and install Win- 
dows NT services, or run an installer in 
a silent or console mode. New features 
include increased platform support, 
greater installer flexibility, enhancements 
to the API, improved installer optimiza- 
tion, and a streamlined interface. 

LEGG 

514 Bryant Street 

San Francisco, CA 94107 

415-512-7771 

http://www.zerog.com/ 


Precise Software Technologies has an- 


nounced RT/Studio, an integrated devel- 
opment platform for wireless, networking, 
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and telecommunications systems. The stu- 
dio includes a POSIX-like RTOS with task 
scheduling, memory management and I/O 
features, as well as an embedded Internet 
stack with 31 modular protocol compo- 
nents (optional protocols are also avail- 
able). The RT/Studio IDE offers an inte- 
grated editor and project management 
utilities. The RT/Studio Toolbox features a 
graphical application builder, resource mon- 
itor, and performance analyzer, as well as 
simulators and the compiler. Task-aware 
and source-level debugging are imple- 
mented through documented APIs, and re- 
mote debugging and heterogeneous mul- 
tiprocessor systems are also supported. 
RT/Studio runs on Windows NT/2000. 

Precise Software Technologies Inc. 

310 Moodie Drive, Suite 308 

Nepean ON 

Canada, K2H 9C4 

613-596-2251 

http://www.psti.com/ 


Cape Clear Software has announced Cape 
Connect Two for Java 2, Enterprise Edi- 
tion (J2EE), a web services platform for 
J2EE and Enterprise JavaBeans (EJB). It 
enables the creation of web services on 
the Internet that link Java, EJB, CORBA, 
and Microsoft .NET-based applications. It 
also offers compatibility with Microsoft 
SOAP Toolkit 2.0. Cape Connect is in- 
tended to let you build web services with- 
out the need to understand enterprise 
technologies such as EJB or J2EE, or XML 
technologies such as SOAP and WSDL. It 
is available for Windows NT/2000, Sun So- 
laris, and Linux. 

Cape Clear Software 

2121 North California Boulevard, Suite 290 

Walnut Creek, CA 94596 

925-974-3311 

http://www.capeclear.com/ 


Pencel has announced Kinectivity Studio, 
a database application tool designed to 
enable custom development of solutions 
for delivery of data on mobile devices. 
Applications created with Kinectivity Stu- 
dio are deployed on Kinectivity, Pencel’s 
mobile extended relational database plat- 
form. Kinectivity Studio provides a define- 
and-deploy system, allowing the creation 
of custom mobile applications without the 
requirement of specialized development 
resources. The system provides the con- 
figurable components most often needed 
for mobile database applications includ- 
ing user interface, data validation, and data 
exchange connections. Access is also pro- 
vided to RoverScript, a scripting language 
based on JavaScript. Kinectivity currently 
supports devices based on the Palm op- 
erating system with plans to expand to 
Microsoft’s Pocket PC platform and RIM 
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Blackberry devices. Kinectivity Studio sup- 
ports Windows 9x/NT/2000. 
Pencel Corp. 
7304 West 130th Street, Suite 190 
Overland Park, KS 66213 
913-814-8511 
http://www.pencel.com/ 


MontaVista Software has announced the 
release of Hard Hat Linux 2.0, based en- 
tirely on the latest Linux 2.4 kernel. Hard 
Hat Linux 2.0 focuses on cross develop- 
ment, with tools available on eight host 
environments, including Red Hat Linux, 
Yellow Dog Linux, SuSE, Mandrake, Tur- 
boLinux, Solaris, native/hosted Hard Hat 
Linux, and VMWare on Windows NT/2000. 
It also features more than 200 software 
packages, and a suite of development tools 
designed to streamline development, de- 
bugging, and deployment of embedded 
applications. 

MontaVista Software 

1237 E. Arques Avenue 

Sunnyvale, CA 94085 

408-328-9200 

http://www.mvista.com/ 


RelexUS has released Linter Application 
Builder (LAB), a rapid application devel- 
opment tool for developing portable GUI 
database applications. LAB offers a report 
generator engine that includes flexible pa- 
rameters for the creation of customized re- 
ports; its component-based architecture is 
designed to be open and extensible. Cur- 
rently, LAB interfaces with RelexUS’s 
RDBMS Linter SQL databases, which run 
on more than 20 operating systems. Inter- 
faces to JDBC databases are planned. 

RelexUS 

10982 W. Crestline Drive 

Littleton, CO 80127 

303-904-4937 

http://www.relexus.com/ 


Infragistics has announced the release of 
.NET-compatible components Ultra- 
ServerTree and UltraListBar, a server-side 
Web Form and client-side Windows Form, 
respectively. UltraServerTree is a hierar- 
chical tree and menu rendering control for 
both DHTML and HTML 3.2 browsers. Vi- 
sual design features are included for con- 
structing, saving, and rendering trees and 
menus to HTML using predefined, cus- 
tomizable templates. UltraListBar is a nav- 
igation control that brings the look of Mi- 
crosoft Outlook to applications, allowing 
users to group and categorize a variety of 
items for quick access. 

Infragistics Corp. 

2540 Route 130 

Cranbury, NJ 08512 

800-231-8588 

http://www. infragistics.com/ 
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Tech Soft America has announced the 
HOOPS 3D Application Framework 
(HOOPS/3dAF) 6.0, an extensible, high- 
level component intended to simplify 
the design, development, and mainte- 
nance of interactive 2D or 3D applica- 
tions. This release includes new 
lightweight library options; CAE en- 
hancements for graphics performance; 
optimizations for rendering, especially 
with the OpenGL pipeline; support for 
environment mapping; support for mul- 
tiline text and text greeking; Python lan- 
guage bindings; and additions to the 
Model/View library. Also included is a 
new module supporting collaborative 
functionality. The new version runs on 
most major operating systems. 

Tech Soft America 

1830 Embarcadero, Suite 103 

Oakland, CA 94606 

510-434-7630 

http://www.hoops3d.com/ 


Text Analysis International has an- 
nounced the beta release of VisualText, 
an IDE for building deep text analysis 
applications. It introduces NLP++, a C++- 
like programming language designed for 
building deep text analyzers. VisualText 
is designed to aid development of a range 
of text analyzer applications, including 
information extraction, categorization, 
and summarization. Text analyzers can 
be developed and deployed in either in- 
terpreted or compiled modes, and a run- 
time API supports integration with broad- 
er applications. 

Text Analysis International Inc. 

1669-2 Hollenbeck Avenue, # 501 

Sunnyvale, CA 94087 

650-417-2059 

http://www.textanalysis.com/ 


Version 7.5.1 of Data Junction’s Integra- 
tion Engine, a real-time integration in- 
frastructure, includes XML support for 
Linux, UNIX, and Win32 environments. 
The Integration Engine provides support 
for all popular enterprise database servers 
on the Linux platform, including MySQL, 
as well as full support for all XML for- 
mats and standards. Java, COM, and C 
APIs are included, and also a command- 
line interface for integration with other 
applications. 

Data Junction Corp. 

2201 Northland Drive 

Austin, TX 78756 _ 

512-459-1308 

http://www.datajunction.com/ 
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SWAINE’S FLAMES 





Flaw and Order 


If the law supposes that, the law is a ass—a idiot. —Charles Dickens 


n May 25 of this year, semiconductor legend Gordon Moore announced he would step down from 
the board of directors of Intel. The most far-reaching effect of this decision is the immediate repeal 
of Moore’s Law, which Moore formulated back in 1965, and which stated that the number of 





0 


transistors that could be packed into an integrated circuit would double every year (later revised to every 
year-and-a-half to two years). The implication of the repeal of Moore’s Law for society, technological 
progress, stock market, and chairmanships of key Senate committees has yet to be assessed. 

Moore is not just going fishing, though. He has set up a $10 billion fund that will back some 
interesting scientific and environmental efforts. Based on these interests of Moore’s, I predict we will 
shortly see some new Moore’s Laws. 





¢ Moore’s Law of Species Extinction. Every two years, the number of species on the Earth will halve. 
If we work really hard on genetic engineering and develop the ability for humans to extract nec- 
essary nutrients from sterile rock and solve the oxygen problem that we’ll face when the last plant 
is gone and reengineer the human digestive system so that it doesn’t require the presence of all 
those microorganisms, we should be able to get the species count down to one. Beyond that, 
Moore’s Law of Species Extinction may not hold precisely. 

Moore’s Law of Distributed Computing. Every two years, the number of people involved in the 
SETIonline project will double. SETI stands for “Search for Extraterrestrial Intelligence,” and the 
project uses spare CPU cycles to crunch data from space for anomalies that could indicate intel- 
ligent life elsewhere in the universe. Eventually, every person on Earth will be giving all the spare 
cycles in all their computers and game machines and embedded processors to SETIonline, and 
then in order for Moore’s Law of Distributed Computing to continue to hold, we'll have to get 
the aliens involved. 

Moore’s Law of College Enrollment. Enrollment in the engineering programs at Stanford, Cal Tech, 
and MIT will double every two years. In the fall of 2013, one million freshmen will enroll in engi- 
neering at Stanford, packing the hills all the way up to Woodside to a density more reminiscent 
of Bangladesh than Palo Alto. In 2017, when they all go back to their home countries, four mil- 
lion new students will replace them. 


Other popular laws have also been repealed in recent years, often with less fanfare than Moore’s 
Law. I don’t know why it is, but in publishing and in government, retractions never get the same 
coverage as the thing retracted. But as a service to DDJ readers, I'll clarify the status of these recent- 
ly retracted or redacted or repealed or otherwise tweaked laws. 


¢ Grosch’s Law. This states that computing power is proportional to the square of system cost. This 
law (P=kC'**2) was shown to be valid for an extreme value of C by the free-computer phenome- 
non of some months ago, although it has been pointed out that “free” there referred to price rather 
than cost. Grosch’s Law failed badly, however, in a recent experiment in which an ISV was actu- 
ally paid to use a particular vendor’s defective parts, resulting in a negative system cost, but the 
negative value for C failed to generate, when squared, a positive value for P. 

e The Law of Evolution by Natural Selection. Amended. It is now the Law of Evolution by Self-Selection 
through the Drinking of Poisoned Kool-Aid or Going off to Join the Aliens in the Tail of the Comet. 

e The Law of Conservation of Angular Momentum. Rescinded by Executive Order, President William 
Jefferson Clinton. Spin can now be infinite. 

e Foster’s Law. Flouted. This law, which says that all Australians in the media must maintain the 
image of beer-swilling male chauvinist pigs, was blatantly flouted by “Xena, Warrior Princess” star 
Lucy Lawless. Well of course she would. 

e The Law of Large Numbers. Still in effect, but made more accessible to the general public by defin- 
ing “large” to mean 6. Interested readers can find more details in the forthcoming “Dummies Guide 
to Large Numbers.” 

e Newton’s Third Law of Motion. For every action there is an equal and opposite reaction. Status 
uncertain. Currently under test by Senator James Jeffords of Vermont. 

e The Law of the Excluded Middle. Status uncertain. Also currently under test by Senator James 


Jeffords of Vermont. 
¢ 


Michael Swaine 
editor-at-large 
mike@swaine.com 
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Medical Pccoitls, credit card information, HR databases or applications you’re building - RSA BSAFE® software is your 
code of choice. Quickly build certificate handling, public key cryptography, secure client-server communication or encryption 
into virtually any wired or wireless application or device. Complex programming made simple. That’s why RSA BSAFE® 
software is found in nearly a billion applications worldwide. RSA Security, your source for authentication, 
encryption and PKI. To qualify for a free RSA BSAFE® SDK, go to www.rsasecurity.com/go/dna 
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The Most Trusted Name in e-Security 
www.rsasecurity.com 
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RSA BSAFE is a registered trademark and RSA Security, “The Most Trusted Name in e-Security” and the RSA Secured logo are trademarks of RSA Security Inc. ©2001 RSA Security Inc. All rights reserved. 





~ InstallAnywhere..., zero c: 


InstallAnywhere does not discriminate on the basis of platform. 
programming language, or developer preference. 


As a developer, you have the right to create the = made InstallAnywhere the most popular multi- 
world's most powerful platform-independent | platform installer. Build politically correct 
deployment solutions. InstallAnywhere gives Ralsiecl lic) acu) 

you the freedom to deploy anything, anytime, 

to any platform. It's no wonder companies like | WWwW.ZeroG.com/installer 


Lucent, BEA, Borland, Cisco, and Nortel have 1.415.512.7771 





2001 Zero G Software, inc instaliAnywhere and Zero G are trademarks or registered trademarks of Zero G Software, inc. All other trademarks are property of thei respective owner 





